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PURIFIED THERMOSTABLE PYROCOCCUS FURIOSUS 
DNA POLYMERASE I 

Technical Field 
5 The present invention relates to a thermostable 

enzyme having DNA polymerase I activity useful in 
nucleic acid synthesis by primer extension reaction. 

Background 

10 The archaebacteria are a recently discovered 

group of microorganisms that grow optimally at 
temperatures above 8 0*^0. Some 20 species of these 
extremely thermophilic bacteria-like organisms have 
been isolated, mainly from shallow submarine and deep 

15 sea geothermal environments. Most of the 

archaebacteria are strict anaerobes and depend on the 
reduction of elemental sulfur for growth. 

The archaebacteria include a group of 
"hyperthermophiles" that grow optimally around 100 **C. 

20 These are presently represented by three distinct 
genera, Pyrodictium, Pyrococcus, and Pyrobaculxim* 
Phyodictium brock ii (T^^ 105 ®C) is an obligate 
autotroph which obtains energy be reducing S° to HgS 
with Hg, while Pvrobaculum islandicum (T^^ 100 *C) is a 

25 faculative heterotroph that uses either organic 

substrates or to reduce S°. in contrast, 
Pyrococcus fuyiosus (T^^ lOO'^C) grows by a 
fermentative- type metabolism rather than by S° 
respiration • It is a strict heterotroph that utilizes 

30 both simple and complex carbohydrates where only 

and CO2 are the detectable products. The organism 
reduces elemental sulfur to HgS apparently as a form 
of detoxification since inhibits growth. 

The discovery of microorganisms growing optimally 

35 around 100 has generated considerable interest in 
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both academic and industrial communities. Both the 
organisms and their enzymes have the potential to 
bridge the gap between biochemical catalysis and many 
industrial chemical conversions. However, knowledge 
5 of the metabolism of the hyperthermophilic 
microorganisms is presently very limited. 

The polymerase chain reaction (PGR) is a powerful 
method for the rapid and exponential amplification of 
target nucleic acid sequences. PGR has facilitated 

10 the development of gene characterization and molecular 
cloning technologies including the direct sequencing 
of PGR amplified DNA, the determination of allelic 
variation, and the detection of infectious and genetic 
disease disorders. PGR is performed by repeated 

15 cycles of heat denaturation of a DNA template 

containing the target secpience, annealing of opposing 
primers to the complementary DNA strands, and 
extension of the annealed primers with a DNA 
polymerase. Multiple PGR cycles result in the 

20 exponential amplification of the nucleotide sequence 
delineated by the flanking amplification primers. 

An important modification of the original PGR 
technique was the substitution of Thermus aquaticus 
(Tag) DNA polymerase in place of the Klenow fragment 

25 of E. coli DNA pol I (Saiki, et al. Science, 230:1350- 

1354 (1988)). The incorporation of a thermostable DNA 
polymerase into the PGR protocol obviates the need for 
repeated enzyme additions and permits elevated 
annealing and primer extension temperatures which 

30 enhance the specificity of primer : template 

associations. Tag polymerase thus serves to increase 
the specificity and simplicity of PGR. 

Although Tag polymerase is used in the vast 
majority of PGR performed today, it has a fundamental 

35 drawback: purified Taq DNA polymerase enzyme is 
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devoid of 3 ' to 5 ' exonuclease activity and thus 
cannot excise misinserted nucleotides (Tindall, et 
al.. Biochemistry. 29:5226-5231 (1990)). Several 
independent studies suggest that 3 • to 5 ' 
5 exonuclease-dependent proofreading enhances the 

fidelity of DNA synthesis. Reyland et al, J. Biol. 
Chem. , 263:6518-6524, 1988; Kunkel et al, J. Biol. 
Chem^, 261:13610-13616, 1986; Bernad et al. Cell , 
58:219-228, 1989. Consistent with these findings, the 

10 observed error rate (mutations per nucleotide per 

cycle) of Tag polymerase is relatively high; estimates 
range from 2 X 10"^ during PCR (Saiki et al., Science . 
239:487-491 (1988); Keohavaong et al. Proc . Natl . 
Acad. Sci. USA . 86:9253-9257 (1989)) to 2 X 10'^ for 

15 base substitution errors produced during a single 
round of DNA synthesis of the lacZ gene (Eckert et 
al., Nucl , Acids Res . . 18:3739-3744 (1990)). 

Polymerase induced mutations incurred during PCR 
increase arithmetically as a function of cycle number. 

20 For example, if an average of two mutations occur 

during one cycle of amplification, 20 mutations will 
occur after 10 cycles and 40 will occur after 20 
cycles. Each mutant and wild type template DNA 
molecule will be amplified exponentially during PCR 

25 and thus a large percentage of the resulting 

amplification products will contain mutations. 
Mutations introduced by Tag polymerase during DNA 
amplification have hindered PCR applications which 
require high fidelity DNA synthesis. 

30 

Summary of the Invention 

A thermostable DNA polymerase from the 
hyperthermophilic, marine archaebacterium, Pyrococcus 
fioriosus (Pfu) has been discovered. The monomeric, 
35 multifunctional enzyme possesses both DNA polymerase 
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and 3» to 5' exonuclease activities • The polymerase 
is extremely thermostable with a temperature optimiam 
near 75 •'C. The purified enzyme functions effectively 
in the polymerase chain reaction (PGR) . In addition, 
5 results from PGR fidelity studies indicate that 
Pyrococcus furiosus DNA polymerase yields 
amplification products containing 12 fold less 
mutations than reaction products from similar 
amplifications performed with Taq DNA polymerase. The 

10 3 ' to 5 ^ exonuclease dependent proofreading activity 

of Pf u DNA polymerase will excise mismatched 3 ' 
terminal nucleotides from primer: template complexes 
and correctly incorporate nucleotides complementary to 
the template strand. 

15 Unlike Taq DNA polymerase, Pfu DNA polymerase 

does not possess 5* to 3» exonuclease activity. Pfu, 
like Taq and Vent polymerases, does exhibit a 
polymerase dependent 5' to 3' strand displacement 
activity. Pfu DNA polymerase remains greater that 95% 

20 active after one hour incubation at 95**C. In 

contrast, Vent polymerase [New England Biolabs (NEB) 
Beverly, MA] looses greater than 50% of its polymerase 
activity after one hour incubation at 95**C. Pfu DNA 
polymerase is thus unexpectedly superior to Taq and 

25 Vent DNA polymerases in amplification protocols 
requiring high fidelity DNA synthesis. 

Thus, the present invention contemplates a 
purified thermostable P. furiosus DNA polymerase I 
(Pfu DNA Pol I or Pyro polymerase) having an amino 

30 terminal amino acid residue sequence represented by 

the formula shown in SEQ ID NO 1, having 775 amino 
acid residues. 

The apparent molecular weight of the native 
protein is about 90,000-93,000 daltoris as determined 

35 by SDS-PAGE \inder non-denaturing (non-reducing) 
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conditions using Taq polymerase as a standard having a 
molecular weight of 94,000 daltons. In preferred 
embodiments, the Pyro polymerase is isolated from P. 
f uriosus , and more preferably has a specific 3 • to 5 • 
5 exonuclease activity. 

Brief Description of the Drawings 

In the drawings forming a portion of this 
disclosure : 

10 Figure 1 illustrates the molecular weight 

determination of Pyro DNA polymerase compared to Taq 
DNA polymerase by SDS-PAGE analysis (8-16% gradient 
gel, under non-reducing conditions) . High molecular 
weight standards (Sigma Chem. Co., St. Louis, MO) are 

15 electrophoresed in lanes 1 and 6. The molecular 

weights of Phosphorylase B (97,200 daltons) and bovine 
serum albumin (66,000 daltons) in lane 6 are indicated 
by arrows. Sequencing grade Taq DNA polymerase which 
has been modified (Pr omega Biotech, Madison, WI) has 

20 an apparent molecular weight of 80,000 daltons (lane 

2). Taq DNA polymerases from Cetus (Emeryville, CA) 
and Stratagene (La Jolla, CA) are electrophoresed in 
lanes 3 and 4, respectively, each with an apparent 
molecular weight of 94,000 daltons. In lane 5, Pyro 

25 DNA polymerase exhibits a molecular weight of 90,000 - 
93,000 daltons. 

Detailed Description of the Invention 
A. Definitions 

30 As used herein, "cell", "cell line", and "cell 

culture" can be used interchangeably and all such 
designations include progeny. Thus, the words 
"transformants" or "transformed cells" includes the 
primary subject cell and cultures derived therefrom 

35 without regard for the number of transfers. It is 
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also understood that all progeny may not be precisely 
identical in DNA content, due to deliberate or 
inadvertent mutations. Mutant progeny that have the 
same functionality as screened for in the originally 
5 transformed cell are included. 

The term "control sequences" refers to DNA 
sequences necessary for the expression of an operably 
linked coding sequence in a particular host organism. 
The control sequences that are suitable for 
10 procaryotes, for example, include a promoter, 

optionally an operator sequence, a ribosome binding 
site, and the like. Eucaryotic cells are known to 
utilize promoters, polyadenylation signals, and 
enhancers . 

15 The term "expression system" refers to DNA 

sequences containing a desired coding sequence and 
control sequences in operable linkage, so that hosts 
transformed with these sequences are capable of 
producing the encoded proteins. In order to effect 

20 transformation, the expression system may be included 
on a vector; however, the relevant DNA may then also 
be integrated into the host chromosome. 

The term "gene" as used herein refers to a DNA 
sequence that encodes a polypeptide. 

25 "Operably linked" refers to juxtaposition such 

that the normal fxonction of the components can be 
performed. Thus, a coding sequence "operably linked" 
to control sequences refers to a conf igiiration wherein 
the coding sequences can be expressed under the 

30 direction of the control sequences. 

The term "oligonucleotide" as used herein is 
defined as a molecule comprised of two or more 
deoxyribonucleotides and/ or ribonucleotidesv 
preferably more than three. Its exact size will 

35 depend on many factors, which in turn depend on the 



wo 92/09689 



PCT/US91/09026 



ultimate function or use of the oligonucleotide. The 
oligonucleotide may be derived synthetically or by 
cloning. 

The term "primer" as used herein refers to an 
5 oligonucleotide, whether occurring naturally or 

produced synthetically, which is capable of acting as 
a point of initiation of nucleic acid synthesis when 
placed under conditions in which synthesis of a primer 
extension product which is complementary to a nucleic 

10 acid strand is induced, i.e., in the presence of four 
different nucleotide triphosphates and thermostable 
enzyme in an appropriate buffer ('buffer" includes pH, 
ionic strength, cof actors, etc.) and at a suitable 
temperature. For Pyro polymerase, the buffer herein 

15 preferably contains 1.5-2 mM of a magnesium salt, 

preferably MgClg, 150-200 /xM of each nucleotide, and 1 
uM of each primer, along with preferably 50 mM KCl, 20 
mM Tris buffer, pH 8-8.4, and 100 fMg/ml gelatin. 

The primer is preferably single-stranded for 

20 maximum efficiency in amplification, but may 

alternatively be double-stranded. If double-stranded, 
the primer is first treated to separate its strands 
before being used to prepare extension products. 
Preferably, the primer is an oligodeoxyribonucleotide. 

25 The primer must be sufficiently long to prime the 

synthesis of extension products in the presence of the 
thermostable enzyme. The exact lengths of the primers 
will depend on many factors, including temperature, 
source of primer and use of the method. For example, 

30 depending on the complexity of the target sequence, 
the oligonucleotide primer typically contains 15-25 
nucleotides, although it may contain more or few 
nucleotides. Short primer molecules generally require 
colder temperatures to form sufficiently stable hybrid 

35 complexes with template. 
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The primers herein are selected to be 
"substantially" complementary to the different strands 
of each specific sequence to be amplified. This means 
that the primers must be sufficiently complementary to 
5 hybridize with their respective strands. Therefore, 
the primer sequence need not reflect the exact 
sequence of the template. For example, a non- 
complementary nucleotide fragment may be attached to 
the 5* end of the primer, with the remainder of the 

10 primer sequence being complementary to the strand. 
Alternatively, non-complementary bases or longer 
sequences can be interspersed into the primer, 
provided that the primer sequence has sufficient 
complementarity with the sequence of the strand to be 

15 amplified to hybridize therewith and thereby form a 

template for synthesis of the extension product of the 
other primer. However, for detection purposes, 
particularly using labeled sequence-specific probes, 
the primers typically have exact complementarity to 

20 obtain the best results. 

As used herein, the term "thermostable enzyme" 
refers to an enzyme which is stable to heat and is 
heat resistant and catalyzes (facilitates) combination 
of the nucleotides in the proper manner to form the 

25 primer extension products that are complementary to 
each nucleic acid strand. Generally, the synthesis 
will be initiated at the 3' end of each primer and 
will proceed in the 5» direction along the template 
strand, until synthesis terminates, producing 

30 molecules of different lengths. 

The thermostable enzyme herein must satisfy a 
single criterion to be effective for the amplication 
reaction, i.e., the enzyme must not become 
irreversibly denatured (inactivated) when subjected to 

35 the elevated temperatures for the time necessary to 



effect denaturation of double-stranded nucleic acids. 
Irreversible denaturation for purposes herein refers 
to permanent and complete loss of enzymatic activity. 
The heating conditions necessary for denaturation will 
depend, e.g., on the buffer salt concentration and the 
length and nucleotide composition of the nucleic acids 
being denatvired, but typically range from about 90 to 
about 96^C for a time depending mainly on the 
temperature and the nucleic acid length, typically 
about 0.5 to four minutes. Higher temperatures may be 
tolerated as the buffer salt concentration and/ or GC 
composition of the nucleic acid is increased. 
Preferably, the enzyme will not become irreversibly 
denatured at about 90-100 **C. 

The thermostable enzyme herein preferably has an 
optimum temperature at which it functions that is 
higher than about 40«C, which is the temperature below 
which hybridization of primer to template is promoted, 
although, depending on (1) magnesixim and salt, 
concentrations and (2) composition and length of 
primer, hybridization can occur at higher temperature 
(e.g., 45-70**C) . The higher the temperature optimum 
for the enzyme, the greater the specificity and/or 
selectivity of the primer-directed extension process. 
However, enzymes that are active below 40 *C, e.g., at 
37®C, are also with the scope of this invention 
provided they are heat-stable. Preferably, the 
optimum temperature ranges from about 50*> to 90^'C, 
more preferably 60-80 «>C. 

Amino Acid Residue : The amino acid residues 
described herein are preferred to be in the "L" 
isomeric form. However, residues in the "D" isomeric 
form can be substituted for any L-amino acid residue, 
as long as the desired functional property is retained 
by the polypeptide. NH2 refers to the free amino 
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group present at the amino- or carboxy- terminus of a 
polypeptide. CO OH refers to the free carboxy group 
present at the carboxy terainus of a polypeptide. The 
amino-terminal NHj group and carboxy-terminal COOH 
5 group of free polypeptides are typically not set forth 

in a formula. A hyphen at the amino- or carboxy- 
terminus of a sequence indicates the presence of a 
further sequence of amino acid residues or a 
respective NH^ or COOH terminal group. In keeping 
10 with standard polypeptide nomenclature, J. Biol, 
Chem. , 243:3552-59 (1969) and adopted at 37 CFR 
SI- 822 (b) (2) , abbreviations for amino acid residues 
are shown in the following Table of Correspondence: 

15 TABLE OF CORRESPONDENCE 

SYMBOL AMINO ACID 

1-Letter 3 -Letter 



Y 


Tyr 


tyrosine 


G 


Gly 


glycine 


F 


Phe 


phenylalanine 


M 


Met 


methionine 


A 


Ala 


alanine 


S 


Ser 


serine 


I 


lie 


isoleucine 


L 


Leu 


leucine 


T 


Thr 


threonine 


V 


Val 


valine 


P 


Pro 


proline 


K 


Lys 


lysine 


H 


His 


histidine 


Q 


Gin 


glutamine 


E 


Glu 


glutamic acid 


W 


Trp 


tryptophan 


R 


Arg 


arginine 


D 


Asp 


aspartic acid 
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N Asn asparagine 

C Cys cysteine 



It should be noted that all amino acid residue 
5 sequences are represented herein by formulae whose 

left and right orientation is in the conventional 
direction of amino-terminus to carboxy-tenninus . 

Nucleotide : a monomer ic unit of DNA or RNA 
consisting of a sugar moiety (pentose) , a phosphate, 

10 and a nitrogenous heterocyclic base. The base is 

linked to the sugar moiety via the glycosidic carbon 
(1' carbon of the pentose) and that combination of 
base and sugar is a nucleoside. When the nucleoside 
contains a phosphate group bonded to the 3 • or 5 ' 

15 position of the pentose it is referred to as a 
nucleotide. A sequence of operatively linked 
nucleotides is typically referred to herein as a "base 
sequence" or "nucleotide sequence", and is represented 
herein by a formula whose left to right orientation is 

20 in the conventional direction of 5*-te3rminus to 3«- 

terminus . 

Base Pair (bp) : A partnership of adenine (A) 
with thymine (T) , or of cytosine (C) with guanine (G) 
in a double stranded DNA molecule. 

25 

B. Pfu DNA Polvmerase I fPvro Polvmerase^ 
Pyro polymerase, the thermostable DNA polymerase 
of the present invention, can be obtained from any 
source and can be a native or recombinant protein. A 
30 preferred Pyro polymerase is isolated from Pvrococcus 

f uriosus . P. furiosus is available from Dentsche 
Sammlung Von Microorganismen (DSM) , Grise-Bach StraSSE 
8, d-3400 Gottengen, FRG, under the accession nximber 
DSM-6217. 

35 For isolating the native protein from P . furiosus 
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cells, such cells are grown using any suitable 
technique. A variety of such techniques have been 
reported, those preferred being described by Fiala et 
al.. Arch, Microbiol , (1986) 145:56-61, and Bryant et 
5 al., J- Biol, Chem. . (1989) 264:5070-5079, the 

disclosures of which are incorporated herein by 
reference • 

After cell growth, the isolation and purification 
of Pyro polymerase takes place in about 3 stages, each 
10 of which is performed at, and preferably below, room 
temperature, preferably about 4°C. 

In the first step, the cells are concentrated 
from the growth medixm, typically by centrifugation or 
filtration. 

15 In the second step, the cells are lysed and the 

supernatant is segregated and recovered from the 
cellular debris. Lysis is typically accomplished by 
mechanically applying sheer stress and/ or enzymatic 
digestion. Segregation of the supernatant is usually 

20 accomplished by centrifugation. 

The third step removes nucleic acids and some 
protein. The supernatant from the second step is 
applied to an agarose resin strong anionic exchange 
column, such as Q-sepharose from Pharmacia 

25 (Piscataway, NJ) ec[uilibriated with column buffer [50 

mM tris-hydroxymethylaminomethane (Tris) , pH 8.2, 10 
mK beta mercaphroethanol, and ImM 
ethylenediaminetetraacetic acid (EDTA) ] . The 
supernatant is washed through the column with the 

30 column buffer and the pass-through and washes are 

collected and centrifuged to remove any insoluble 
material. The supernatant is segregated, usually 
dialyzed, and then recovered to form a fraction 
containing partially purified Pyro polymerase. 

35 The fourth step removes substantially all (90%) 
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of the remaining contaminating proteins and comprises 
applying the fraction recovered from step three to a 
phosphocellulose coliimn equilibriated with the before 
described colvmn buffer. The column is washed with 
5 the column buffer until the optical density of the 

wash eluate is at the buffer baseline at 280 nm. The 
immobilized Pyro polymerase is thereafter eluted with 
a linear salt gradient comprising O M to about 0.7 M 
salt dissolved in the column buffer, the salt being 

10 NaCl, KCl, and the like. Protein eluted from the 
column at about 200 mM salt typically contains the 
highest concentrations of assayable Pyro polymerase. 

In preferred embodiments, the Pyro polymerase 
preparation obtained from the fourth step is further 

15 purified in a fifth step by FPLC chromatography 

through a high performance cation exchange colimn, 
such as the Mono S colximn available from Pharmacia, 
Piscataway, NJ, equilbriated with the before described 
column buffer. After application, the column is 

20 washed to remove non-bound contaminants. The 

immobilized Pyro polymerase is then eluted with the 
before-described linear salt gradient at about 120 nM 
salt concentration. The Pyro polymerase eluate is 
then typically dialysed against the column buffer to 

25 remove excess salt. A stabilizing agent, such as 

glycerol, can be added to the preparation at this time 
to facilitate low temperature storage. Typically, the 
fraction is again dialyzed against a low salt buffer, 
e.g., 50 mM Tris pH 7.5, 1 mM dithiothreitol, 0.1 mM 

30 EDTA, 0.1% Tween 20, and 0.1% non-idet P40. 

In further preferred embodiments the Pyro 
polymerase preparation of step five is applied to a 
crosslinked agarose affinity column, such as the Affi- 
Gel Blue column available from BioRad, Richmond, CA, 

35 equilibrated with the before-described colxmn buffer. 
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Non-bound protein is washed from the colvmn and the 
Pyro polymerase is eluted with the before-described 
salt gradient with the Pyro polymerase typically being 
recovered at about 280 mM salt concentration, 
5 Thereafter, the Pyro polymerase preparation is usually 

concentrated about 5-10 fold and dialysed against 
colxamn buffer. Typically, a stabilizing agent, such 
as glycerol, is added to the preparation to facilitate 
low temperature storage. 

10 The amino-terminal amino acid residue sequence of 

Pyro polymerase can be determined by any suitable 
method, such as by automated Edman degradation, and 
the like. The amino acid residue sequence of a 
preferred Pyro polymerase is shown in SEQ ID NO 1 from 

15 residue 1 to 775. 

The molecular weight of the dialyzed product may 
be determined by any technique, for example, by sodium 
dodecylsulfate-polyacrylamide gel electrophoresis 
(SDS-PAGE) using protein molecular weight markers. 

20 Native Pyro polymerase purified by the above method 

has a relative molecular weight, determined by SDS- 
PAGE under non-reducing conditions, of about 90,000- 
93,000 daltons. 

In preferred embodiments, Pyro polymerase is used 

25 in combination with a thermostable buffering agent 
such as TAPS (N-tris[Hydroxymethyl]methyl-3- 
aminopropanesulfonic acid; ( [2-Hydroxy-l, 1- 
bis (hydroxy-methyl) -ethyl] amino- 1-propanesulfonic 
acid), available from Sigma, St. Louis, MO (Catalog 

30 P7905) . 

C. Recombinant Pvro Polymerase 

Pyro polymerase can also be produced by 
recombinant DNA (rDNA) techniques, as the gene 
35 encoding the enzyme can be cloned from P. furiosus 
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genoiaic DNA. Thus, the present invention also 
contemplates a DNA segment consisting essentially of a 
sequence of nucleotide base sequence encoding a Pyro 
polymerase of this invention. An exemplary DNA 
5 sequence, obtained from the native gene, coding for a 

preferred Pfu I protein is shown in SEQ ID NO 2 from 
nucleotide base 224 to base 2548, which spans the 
coding portion of SEQ ID NO 2. 

The isolated gene can be operably linked to an 

10 expression system to form an rDNA capable of 

expressing, in a compatible host, Pyro polymerase. 

Of course, modifications to the primary structure 
itself by deletion, addition, or alteration of the 
amino acids incorporated into the protein secjuence 

15 during translation can be made without destroying the 

activity of the protein. Such substitutions or other 
alterations result in proteins having an amino acid 
sequence encoded by DNA falling within the 
contemplated scope of the present invention. 

20 1. Cloning and Expression of the Pyro 

Polvmerase Gene 

Polyclonal antiserxim from rabbits 
immunized with the purified 90,000-93,000 dalton 
polymerase of this invention can be used to probe a Rj. 

25 furiosus partial genomic expression library to obtain 
the appropriate coding sequence as described below. 
The cloned genomic sequence can be expressed as a 
fusion protein, expressed directly using its own 
control sequences, or expressed by constructions using 

30 control secjuences appropriate to the particular host 
used for expression of the enzyme. 

Thus, the complete coding sequence for Pyro 
polymerase from which expression vectors applicable to 
a variety of host systems can be constructed and the 

35 coding sequence expressed. It is also evident from 
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the foregoing that portions of the Pyro polymerase- 
encoding sequence are useful as probes to retrieve 
other similar thermostable polymerase-encoding 
sequences in a variety of Archaebacteria species, 
5 particularly from other Pyrococcus species and P^ 
furiosus strains. Accordingly, portions of the 
genomic DNA encoding at least six contiguous amino 
•acids can be synthesized and used as probes to 
retrieve additional DNAs encoding an Archaebacteria 

10 thermostable polymerase. Because there may not be a 
precisely exact match between the nucleotide sequence 
in the P. furiosus form described herein and that in 
the corresponding portion of other species or strain, 
oligomers containing approximately 18 nucleotides 

15 (encoding the six amino acid stretch) are probably 

necessary to obtain hybridization under conditions of 
sufficient stringency to eliminate false positives. 
The sequences encoding six amino acids would supply 
information sufficient for such probes. 

20 Exemplary degenerate Pfu DNA polymerase I gene 

probes are shown in Table 1. 



Table 1 

Degenerate Pfu DNA Polymerase I Gene Probes 
A 

T T T I 

1 ) 5 • GACTACATCACCGAIGAIGG3 • 

T T A A 

2 ) 5 • CCCTCCTCIGTIATGTAGTC3 • 

T A A T 

3 ) 5 » TTCAAGAAGAACGG-3 » 

A T T A 

4 ) 5 » CCGTTCTTCTTGAA-3 • 



Nucleotide sequence degenerate substitutions are 
bases directly above the positions in the sequence 
where the substitutions were made. In some cases, 
degeneracy was accomplished by substituting 
inosine (I) in the sequence. 

A preferred and exemplary cloning protocol for 
isolation of a pfu pol I gene is described in the 
Examples. From the clone pF72 described in the 
Examples, the nucleotide sequence of a preferred gene 
encoding pfu pol I was described and is shown in SEQ 
ID NO 2, and can be utilized for the production of 
recombinant Pyro polymerase. 

In general terms, the production of a recombinant 
form of Pyro polymerase typically involves the 
following: 

First, a DNA is obtained that encodes the mature 
(used here to include all muteins) enzyme or a fusion 
of the Pyro polymerase either to an additional 
sequence that does not destroy its activity, or to an 
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additional sequence cleavable under controlled 
conditions Csuch as treatment with peptidase) to give 
an active protein. If the sequence is uninterrupted 
by introns it is suitable for expression in any host, 
5 This sequence should be in an excisable and 
recoverable form. 

The excised or recovered coding sequence is then 
preferably placed in operable linkage with suitable 
control sequences in a replicable expression vector. 

10 The vector is used to transform a suitable host and 
the transformed host cultured under favorable 
conditions to effect the production of the recombinant 
Pyro polymerase. Optionally the Pyro polymerase is 
isolated from the medium or from the cells; recovery 

15 and purification of the protein may not be necessary 

in some instances, where some impurities may be 
tolerated. 

Each of the foregoing steps can be done in a 
variety of ways. For example , the desired coding 

20 sec[uences may be obtained from genomic fragments and 

used directly in appropriate hosts* The constructions 
for expression vectors operable in a variety of hosts 
are made using appropriate replicons and control 
sequences, as set forth below. Suitable restriction 

25 sites can, if not normally available, be added to the 

ends of the coding sequence so as to provide an 
excisable gene to insert into these vectors. 

The control sequences, expression vectors, and 
transformation methods are dependent on the type of 

30 host cell used to express the gene. Generally, 

procaryotic, yeast, insect or mammalian cells are 
presently useful as hosts. Procaryotic hosts are in 
general the most efficient and convenient for the 
production of recombinant proteins and therefore are 

35 preferred for the expression of Pyro polymerase. 
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2. Control Sequences and Corresponding 

dosts 

Procaryotes most frequently are 
represented by various strains of E> coli . However, 
5 other microbial strains may also be used, such as 

bacilli, for example. Bacillus subtillis, various 
species of Pseudomonas, or other bacterial strains. 
In such procaryotic systems, plasmid vectors that 
contain replication sites and control sequences 

10 derived from species compatible with the host are 

used. For example, E. coli is typically transformed 
using derivatives of pBR322, a plasmid derived from an 
E. coli species by Bolivar, et al.. Gene , (1977) 2:95 
and Sutcliffe, Nuc. IVcj.ds Res., (1978) 5:2721^28. 

15 pBR322 contains genes for ampicillin and tetracycline 
resistance, and thus provides additional markers that 
can be either retained or destroyed in constructing 
the desired vector. Commonly used procaryotic control 
sequences, which are defined herein to include 

20 promoters for transcription initiation, optionally 
with an operator, along with ribosome binding site 
sequences, include such commonly used promoters as the 
B-lactamase (penicillinase) and lactose (lac) promoter 
systems (Chang, et al.. Nature, (1977) 198:1056), the 

25 tryptophan (trp) promoter system (Goeddel, et al.. 
Nucleic Acids Res. . (1980) 8:4057) and the lambda- 
derived promoter (Shimatake, et al.. Nature, (1981) 
292:128) and N-gene ribosome binding site, which has 
been made useful as a portable control cassette (as 

30 set forth in U.S. Patent No. 4,711,845), which 

comprises a first DNA sequence that is the promoter 
operably linked to a stream of a third DNA sequence 
having at least one restriction site that permits 
cleavage with six bp 3« of the Nj^gg sequence. Also 

35 useful is the phosphatase A (phoA) system described by 
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Change, et al. in European Patent Publication No. 
196 ,.864. However, any available promoter system 
compatible with procaryotes can be used. Typical 
bacterial plasmids are pUC8, pUC9, pBR322 and pBR329 
5 available from Bio-Rad Laboratories, (Richmond, CA) 
and pPL and pkk233-2, available from Pharmacia 
(Piscataway, NJ) or Clone Tech (Palo Alto, CA) . 

In addition to bacteria, eucaryotic microbes, 
such as yeast, may also be used as hosts. Laboratory 

10 strains of Saccharomvces cerevisiae. Baker's yeast, 

are most used, although a number of other strains are 
commonly available. While vectors employing the 2 
micron origin of replication are illustrated (Broach, 
J.R., Meth. Enz. . (1983) 101:307), other plasmid 

15 vectors suitable for yeast expression are known (see, 
for example, Stinchcomb, et al. , Nature, (1979) 
282:39, Tschempe, et al.. Gene . (1980) 10:157, Clarke, 
L., et al-, Meth. Enz (1983) 101:300), Brake et al. , 
Proc. Natl. Acad. Sci. USA , (1984) 81:4642-4647, and 

20 Halewell et al., Biotechnoloov , (1987) 5:363-366. 

Control sequences for yeast vectors include promoters 
for the synthesis of glycolytic enzymes (Hess, et al. , 
J, Adv. EnzvmeReq. , (1968) 7:149; Holland, et al., 
Biotechnolocrv (1978) 17:4900). 

25 Additional promoters known in the art include the 

promoter for 3-phosphoglycerate kinase (Hitzeman, et 
al., J. Biol. Chem. , (1980) 255:2073) and those for 
other glycolytic enzymes, such as glyceraldehyde-3- 
phosphate dehydrogenase, hexokinase, pyruvate 

30 decarboxylase, phosphofructokinase, glucose- 6- 

phosphate isomerase, 3-phosphoglycerate mutase, 
pyruvate kinase, triosephosphate isomerase, 
phosphoglucose isomerase, and glucokinase. Other 
promoters that have the additional advantage of 

35 transcription controlled by growth conditions are the 
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promoter regions for alcohol dehydrogenase 2 , 
iscoytoctooine C, acid phosphatase, degradative enzymes 
associated with nitrogen metabolism, and enzymes 
responsible for maltose and galactose utilization 
5 (Holland, supra) . 

It is also believed that terminator sequences are 
desirable at the 3' end of the coding sequences « Such 
terminators are found in the 3» untranslated region 
following the coding sequences in yeast-derived genes. 

10 Many of the vectors illustrated contain control 

sequences derived from the enolase gene containing 
plasmid peno46 (Holland, M. M. , et al., J> Biol Chem, , 
(1981) 256:1385) or the LEU2 gene obtained from YEpl3 
(Broach, J*, et al.. Gene . (1978) 8:21); however, any 

15 vector containing a yeast-compatible promoter, origin 
of replication, and other control sequences is 
suitable. 

It is also, of course, possible to express genes 
encoding polypeptides in eucaryotic host cell cultures 

20 derived from multicellular organisms. See, for 

example. Tissue Culture . Academic Press, Cruz and 
Patterson, editors (1973). Useful host cell lines 
include murine myelomas N51, VERO and HeLA cells, and 
Chinese hamster ovary (CHO) cells available from the 

25 ATCC as CCL61, and NIH/3T3 mouse cells available from 
the ATCC as CRL1658. Expression vectors for such 
cells ordinarily include promoters and control 
sequences compatible with mammalian cells such as, for 
example, the commonly used early and late promoters 

30 from Simian Virus 40 (SV 40) (Fiers, et al.. Nature, 
(1978) 273:113), or other viral promoters such as 
those derived from polyoma. Adenovirus 2, bovine 
papilloma virus, or avian sarcoma viruses, or 
immunoglobulin promoters and heat shock promoters. A 

35 system for expressing DNA in maiomalian systems using 
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the BPV as a vector is disclosed in U.S. Patent No. 

4,419^446. A modification of this system is described 

in U.S. Patent No. 4,601,978. General aspects of 

mammalian cell host system transf omnaations have been 
5 described in U.S. Patent No. 4,399^216. It now 

appears, also, that "enhancer" regions are important 

in optimizing expression; these are,, generally, 

secpiences found upstream of the promoter region. 

Origins of replication may be obtained, if needed, 
10 from viral sources. However, integration into the 

chromosome is a common mechanism for DNA replication 

in eucaryotes. 

Plant cells are also now available as hosts, and 

control sequences compatible with plant cells such as 
15 the nopaline synthase promoter and polyadenylation 

signal sequences (Depicker, A., et al. , J. Mol. Appl. 

Gen. > (1982) 1:561) are available. See, also, U.S. 

Patents No. 4,962,028, No. 4,956,282, No. 4,886,753 

and No. 4,801,540. 
20 Recently, in addition, expression systems 

employing insect cells utilizing the control systems 

provided by baculovirus vectors have been described 

(Miller, D.W. , et al., in Genetic Enaineerincr (1986) 

Setlow, J.K. etal., eds., Plentim Publishing, Vol. 8, 
25 pp. 277-297). See, also, U.S. Patents No. 4,745,051 

and No. 4,879,236. These systems are also successful 

in producing Pyro polymerase. 

A preferred DNA segment containing both the pfu 

pol I coding portion and control sequences at the 5' 
30 and 3 ' termini of the coding portion is shown in SEQ 

ID NO 2 from nucleotide base 1 to base 3499. 
3. Transformations 

The recombinant DNA molecules of the 
present invention are introduced into host cells, via 
35 a procedure commonly known as transformation or 
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transfection. Transformation of appropriate host 
cells with a recombinant DNA molecule of the present 
invention is accomplished by well known methods that 
typically depend on the type of vector used. With 
5 regard to transformation of procaryotic host cells or 
other cells that contain substantial cell wall 
barriers, see, for example, Cohen et al., Proc. Natl, 
Acad. Sci. USA . 69:2110 (1972); and Maniatis et al.. 
Molecular Cloning. A Laboratorv Manual, Cold Spring 

10 Harbor Laboratory, Cold Spring Harbor, NY (1982) • 

With regard to transformation of vertebrate cells with 
retroviral vectors containing rDNA, see, for example, 
Sorge et al., Mol> Cell. Biol. . 4:1730-37 (1984); and 
Wigler et al., Proc. Natl. Acad. Sci. USA . 76:1373-76 

15 (1979). 

Infection with Aarobacterium tumefaciens (Shaw, 
C.H., et al.. Gene . (1983) 23:315) is used for certain 
plant cells. For mammalian cells without cell walls, 
the calcium phosphate precipitation method of Graham 

20 and van der Eb, Viroloav (1978) 52:546 is preferred. 

Transformations into yeast are carried out according 
to the method of Van Solingen, P., et al., J. Bact. 
(1977) 130:946 and Hsiao, C.L., et al., Proc. Natl. 
Acad. Sci, fUSA) . (1979) 76:3829. 

25 Successfully transformed cells, i.e., cells that 

contain a recombinant DNA (rDNA) molecule of the 
present invention, are usually monitored by an 
appropriate immunological, hybridization or functional 
assay. For example, cells resulting from the 

30 introduction of an rDNA of the present invention can 

be cloned to produce monoclonal colonies. Cells from 
those colonies can be harvested, lysed and their DNA 
content examined for the presence of the rDNA using a 
method such as that described by Southern, J. Mol. 

35 Biol . . 98:503 (1975) or Berent et al., Biotech. ^ 3:208 
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(1985) . 

In addition to directly assaying for the presence 
of rDNA, successful transformation can be confirmed by 
veil known immunological methods when the rDNA is 
5 capable of directing the expression of Pyro 
polymerase. For example, cells successfully 
transformed with a subject rDNA containing an 
expression vector produce a polypeptide displaying a 
characteristic antigenicity. Samples of a culture 

10 containing cells suspected of being transformed are 

harvested and assayed for a subject polypeptide (Pyro 
polymerase) using antibodies specific for that 
polypeptide antigen, such as those produced by an 
appropriate hybridoma. 

15 A particularly convenient assay technique 

involves fusing the Pyro polymerase-encoding DNA to a 
Lac Z gene in a suitable plasmid, e.g. pLG. Since the 
plasmid lacks a promoter and Shine-Da Igar no sequence, 
no 6-galactosidase is synthesized. However, when a 

20 portable promoter fragment is properly positioned in 
front of the fused gene, high levels of a fusion 
protein having B-galactosidase activity should be 
expressed. The plasmids are used to transform Lac- 
bacteria which are scored for B-galactosidase activity 

25 on lactose indicator plates. Plasmids having 

optimally placed promoter fragments are thereby 
recognized. These plasmids can then be used to 
reconstitute the fusion protein gene which is 
expressed at high levels. 

30 Thus, in addition to the transformed host cells 

themselves, cultxires of the cells are contemplated as 
within the present invention. The cultures include 
monoclonal (clonally homogeneous) cultures, or 
cultures derived from a monoclonal culture, in a 

35 nutrient medium. Nutrient media useful for culturing 
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transformed host cells are well known in the art and 
can be obtained from several cosaaercial sources. In 
embodiments wherein the host cell is mammalian, a 
"serum-free" medium is preferably used. 
5 The present method entails culturing a nutrient 

medium containing host cells transf omxted with a 
recombinant DNA molecule of the present invention that 
is capable of expressing a gene encoding a subject 
polypeptide. The culture is maintained for a time 

10 period sufficient for the transformed cells to express 
the subject polypeptide. The expressed polypeptide is 
then recovered from the culture. 

Once a gene has been expressed in high levels, a 
DNA fragment containing the entire expression 

15 assembly, e.g., promoter, ribosome-binding site, and 
fusion protein gene) may be transferred to a plasmid 
that can attain very high copy numbers. For instance, 
the temperature-inducible "runaway replication" vector 
pKN402 may be used. Preferably, the plasmid selected 

20 will have additional cloning sites which allow one to 
score for insertion of the gene assembly. See, 
Bittner et al. Gene . 15:31 (1981). Bacterial cultures 
transformed with the plasmids are grown for a few 
hours to increase plasmid copy nvimber, e.g., to more 

25 than 1000 copies per cell. Induction may be performed 

in some cases by elevated temperature and in other 
cases by addition of an inactivating agent to a 
repressor. Potentially very large increases in cloned 
fusion proteins can be obtained in this way. 

30 4 . Construction of a Lambda Expression 

Library 

The strategy for isolating DNA encoding 
desired proteins such as the Pyro polymerase encoding 
DNA, using the bacteriophage vector lambda gtll, is as 
35 follows. A library can be constructed of EcoRI- 
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f lanked Alul fragments, generated by complete 
digestion of P , f ur iosus DNA , inserted at the EcoRI 
site in the lambda gtll phage (Young and Davis, Proc, 
Natl, Acad, Sci, fUSAl > (1983) 80: 1194-1198) • Because 
5 the unique EcoRI site in this bacteriophage is located 
in the carboxyl-terminus of the B-galactosidase gene, 
inserted DNA (in the appropriate frame and 
orientation) is expressed as protein fused with B- 
galactosidase under the control of the lactose operon 

10 prompter / operator . 

Genomic expression libraries are then screened 
using the antibody plaque hybridization procedure. A 
modification of this procedure, referred to as 
"epitope selection," uses antiserum against the fusion 

15 protein sequence encoded by the phage, to confirm the 
identification of hybridized placjues. Thus, this 
library of recombinant phages could be screened with 
antibodies that recognize the 90,000-93,000 dalton 
Pyro polymerase in order to identify phage that carry 

20 DNA segments encoding the antigenic determinants of 
the Pyro polymerase protein. 

Approximately 2 X 10^ recombinant phage are 
screened using rabbit Pyro polymerase antiserum. In 
this primary screen, positive signals are detected and 

25 one or more of these plaques are purified from 

candidate plaques which failed to react with preimmxine 
serum and reacted with immune serxim and analyzed in 
some detail. Anti-Pyro polymerase antibodies can be 
prepared by a number of known methods, see, for 

30 example, U.S. Patent No. 4,082,735, No. 4,082,736, and 

No, 4,493,795. 

To examine the fusion proteins produced by the 
recombinant phage, lysogens of the phage in the host 
Y1089 are produced. Upon induction of the lysogens 

35 and gel electrophoresis of the resulting proteins. 



each lysogen may be observed to produce a new protein, 
not found in the other lysogens, or duplicate 
sequences may result. Phage containing positive 
signals are picked. Typically, one positive plaque is 
picked for further identification and replated at 
lower densities to purify recombinants and the 
purified clones are analyzed by size class via 
digestion with EcoRI restriction enzyme. Probes can 
then be made of the isolated DNA insert sequences and 
labeled appropriately and these probes can be used in 
conventional colony or plaque hybridization assays 
described in Maniatis et al . , Molecular Cloning: A 
Laboratory Manual , (1982), the disclosure of which is 
incorporated herein by reference. 

5. Recombinant DNA M olecules 

The present invention further 
contemplates a recombinant DNA (rDNA) that includes a 
Pyro polymerase-isncoding DNA segment of the present 
invention operatively linked to a vector for 
replication and/or expression. Preferred rDNA 
molecules contain less than 50,000 nucleotide base 
pairs, usually less than 20,000 base pairs and 
preferably less than about 10,000 base pairs. 
Preferably, a Pyro polymerase-encoding DNA of this 
invention is in the form of a plasmid, cosmid or 
phage . 

A preferred rDNA molecule includes a nucleotide 
sequence shown in SEQ ID NO 2 from nucleotide base 224 
to base 2548. 

A rDNA molecule of the present invention can be 
produced by operatively linking a vector to a DNA 
segment of the present invention. 

As used herein, the term "vector" refers to a 
nucleic acid molecule capable of transporting between 
different genetic environments another nucleic acid to 



wo 92/09689 



PCr/US91/09026 



-28- 

which it has been operatively linked Preferred 
vectors are those capable of autonomous replication 
and/or expression of nucleic acids to which they are 
operatively linked are referred to herein as 
5 "expression vectors". As used herein, the term 

"operatively linked"^ in reference to DNA segments , 
describes that the nucleotide sequence is joined to 
the vector so that the sequence is under the 
transcriptional and translation control of the 
10 expression vector and can be expressed in a suitable 

host cell. 

As is well known in the art, the choice of vector 
to which a protein encoding DNA segment of the present 
invention is operatively linked depends upon the 

15 functional properties desired, e.g., protein 

expression,^ and upon the host cell to be transformed. 
These limitations are inherent in the art of 
constructing recombinant DNA molecules. However, a 
vector contemplated by the present invention is at 

20 least capable of directing the replication, and 

preferably also expression, of a gene operatively 
linked to the vector. 

In preferred embodiments, a vector contemplated 
by the present invention includes a procaryotic 

25 replicon, i.e., a DNA sequence having the ability to 
direct autonomous replication and maintenance of the 
recombinant DNA molecule extrachromosomally in a 
procaryotic host cell, such as a bacterial host cell, 
transformed therewith. Such repl icons are well known 

30 in the art. In addition, those embodiments that 

include a procaryotic replicon may also include a gene 
whose expression confers a selective advantage such as 
amino acid nutrient dependency or drug resistance to a 
bacterial host transformed therewith as is well known, 

35 in order to allow selection of transformed clones. 
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Typical bacterial drug resistance genes are those that 
confer resistance to ampicillin, tetracycline, or 
kanamycin. 

Those vectors that include a procaryotic replicon 
5 may also include a procaryotic promoter capable of 
directing the expression (transcription and 
translation) of the gene transformed therewith. A 
promoter is an expression control element formed by a 
DNA sequence that permits binding of RNA polymerase 

10 and transcription to occur. Promoter sequences 

compatible with bacterial hosts are typically provided 
in plasmid vectors containing convenient restriction 
sites for insertion of a DNA segment of the present 
invention. Bacterial expression systems, and choice 

15 and use of vectors in those systems is described in 
detail in "Gene Expression Technology", Meth, 
Enz ymol . , Vol 185, Goeddel, Ed., Academic Press, NY 
(1990) . 

Expression vectors compatible with eucaryotic 

20 cells, preferably those compatible with vertebrate 
cells, can also be used to form the recombinant DNA 
molecules of the present invention. Eucaryotic cell 
expression vectors are well known in the art and are 
available from several commercial sources. Typically, 

25 such vectors are provided containing convenient 

restriction sites for insertion of the desired gene. 
Typical of such vectors are pSVL and pKSV-lO 
(Pharmacia) , pBPV-l/pML2d (International 
Biotechnologies, Inc.), and pTDTl (ATCC, #31255). 

30 In preferred embodiments, the eucaryotic cell 

expression vectors used to construct the recombinant 
DNA molecules of the present invention include a 
selectable phenotypic marker that is effective in a 
eucaryotic cell, such as a drug resistance selection 

35 marker or selective marker based on nutrient 
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dependency. A preferred drug resistance marker is the 
gene whose expression results in neomycin resistance, 
i.e. , the neomycin phosphotransferase (neo) gene. 
Southern et al., J. t^ol. Anol. Genet. , 1:327-341 

5 (1982) . 

The use of retroviral expression vectors to form 
the rDNAs of the present invention is also 
contemplated. As used herein, the term "retroviral 
expression vector" refers to a DNA molecule that 

10 includes a promoter sequence derived from the long 
terminal repeat (UTR) region of a retrovirus genome. 

In addition to using strong promoter sequences to 
generate large quantities of mRNA coding for the 
expressed fusion proteins of the present invention, it 

15 is desirable to provide ribosome-binding sites in the 

mRNA to ensure efficient translation. The ribosome- 
binding site in E. coli includes an initiation codon 
(AUG) and a sequence 3-9 nucleotides long located 3-11 
nucleotides upstream from the initiation codon (the 

20 Shine-Dalgarno sequence). See, Shine et al.. Nature, 

254:34 (1975). Methods for including a ribosome- 
binding site in mRNAs corresponding to the expressed 
proteins are described by Maniatis, et al. Molecular 
Cloning: A LaboT-atorv Manual . Cold Spring Harbor 

25 Laboratory Press, NY pp. 412-417 (1982). Ribosome 

binding sites can be modified to produce optimum 
configuration relative to the structural gene for 
maximal expression of the structural gene. Halewell 
et al., Nucl. Acid. Res. , (1985) 13:2017-2034. 

30 Construction of suitable vectors containing the 

desired coding and control sequences employs standard 
ligation and restriction techniques that are well 
understood in the art. Isolated plasmids, DNA 
sequences, or synthesized oligonucleotides are 

35 cleaved, tailored, and religated in the form desired. 
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Site-specif ic DNA cleavage is performed by 
treating with the suitable restriction enzyme (or 
enzymes) under conditions that are generally 
understood in the art, and the particulars of which 
5 are specified by the manufacturer of these 

commercially available restriction enzymes. See e.g., 
New England Biolabs, Product Catalog. In general, 
complete digestion is obtained by admixing about 1 ug 
of plasmid or DNA seguence with one unit of enzyme in 

10 about 20 ml of buffer solution. Incubation times of 

about one hour to two hours at about 37 ®C are 
workable, although variations can be tolerated. After 
each incubation, protein is removed by extraction with 
phenol/chloroform, and may be followed by ether 

15 extraction, and the nucleic acid recovered from 

agueous fractions by precipitation with ethanol. If 
desired, size separations is found in Methods in 
Enzymology (1980) 65:499-560. 

Restriction-cleaved fragments may be blunt-ended 

20 by treating with large fragment E. coli DNA polymerase 

I (Klenow) in the presence of the four deoxynucleotide 
triphosphates (dNTPs) using incubation times of about 
15 to 25 minutes at 20 to 25^C in 50 mM Tris pH 7.6, 
50 mM NaCl, 10 mM MgClg, 10 mM DTT and 50-100 fM 

25 dNTPs. The Klenow fragment fills in at 5 • sticky 

ends, but chews back protruding 3' single strands even 
though the four dNTPs are present. If desired, 
selective repair can be performed by supplying only 
one of the, or selected, dNTPs within the limitations 

30 dictated by the nature of the sticky ends. After 

treatment with Klenow, the mixture is extracted with 
phenol/chloroform and ethanol precipitated. Treatment 
under appropriate conditions with Si nuclease results 
in hydrolysis of any single-stranded portion. 

35 Synthetic oligonucleotides may be prepared using 
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the triester laethod of Matteucci , et al . , ( J. Aitt> 
Chem. Soc.^ (1981) 103:3185-3191) or using automated 
synthesis methods. Kinasing of single strands prior 
to annealing or for labeling is achieved using an 
5 excess, e.g., approximately 10 units of polynucleotide 

kinase to 1 nM substrate in the presence of 50 mM 
Tris, pH 7.6, 10 mN MgClg, 5 mM dithiothreitol , 1-2 mM 
ATP. If kinasing is for labeling of probe, the ATP 
will contain high specific activity ^^P. 

10 Ligations are performed in 15-30 Ml volumes under 

the following standard conditions and temperatures: 20 
mM Tris-Cl pH 7.5, lOmM MgCl^, 10 mM DTT, 33 fig/ml 
BSA, 10 mM-50 mM NaCl, and either 40 fM ATP, 0.01-0.02 
(Weiss) units TA DNA ligase at 0**C (for "sticky end" 

15 ligation) or 1 mM ATP, 0.3-0.6 (Weiss) units T4 DNA 

ligase at 14 ^'C (for "blunt end" ligation). 
Intermolecular "sticky end" ligations are usually 
performed at 33-100 fxg/ml total DNA concentrations (5- 
100 nM total end concentration) . Intermolecular blunt 

20 end ligations (usually employing a 10-30 fold molar 

excess of linkers) are performed at 1 /LtM total ends 
concentration . 

In vector construction employing "vector 
fragments", the vector fragment is commonly treated 

25 with bacterial alkaline phosphatase (BAP) in order to 
remove the 5* phosphate and prevent religation of the 
vector. BAP digestions are conducted at PH 8 in 
approximately 150 mM Tris, in the presence of Na* and 
Mg*^ using about 1 unit of BAP per mg of vector at 

30 eo^'C for about one hour. In order to recover the 

nucleic acid fragments, the preparation is extracted 
with phenol/ chlorofoma and ethanol precipitated. 
Alternatively, religation can be prevented in vectors 
that have been double digested by additional 

35 restriction enzyme digestion of the unwanted 
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For portions of vectors derived from cDNA or 
genomic DNA that require sequence modifications, sit- 
specif ic primer-directed mutagenesis is used, this 
5 technique is now standard in the art, and is conducted 

using a primer synthetic oligonucleotide complementary 
to a single-stranded phage DNA to be mutagenized 
except for limited mismatching, representing the 
desired mutation. Briefly, the synthetic 

10 oligonucleotide is used as a primer to direct 

synthesis of a strand complementary to the phage, and 
the resulting double-stranded DNA is transformed into 
a phage-supporting host bacterium. Cultures of the 
transformed bacteria are plated in top agar, 

15 permitting plaque formation from single cells that 

harbor the phage. 

Theoretically, 50% of the new plaques will 
contain the phage having, as a single strand, the 
mutated form; 50% will have the original sequence. 

20 The plaques are transferred to nitrocellulose filters 
and the "lifts" hybridized with kinased synthetic 
primer at a temperature that permits hybridization of 
an exact match, but at which the mismatches with the 
original strand are sufficient to prevent 

25 hybridization. Plaques that hybridize with the probe 

are then picked and cultured, and the DNA is 
recovered . 

6. Verification of Construction 
Correct ligations for plasmid 
30 construction are confirmed by first transforming E. 

coli strain MM294, or other suitable host, with the 
ligation mixture. Successful transf ormants are 
selected by ampicillin, tetracycline or other 
antibiotic resistance or using other markers, 
35 depending on the mode of plasmid construction, as is 
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understood in the art. Plasmids from the 
transformants are then prepared according to the 
method of Glewell, D.B., et al., Proc> Natl, Acad. 
sci, fUSA) , (1969) 62:1159, optionally following 
5 chloramphenicol amplication (Clewell, D.B,, J^. 

Bacterid . . (1972), 110:667). The isolated DNA is 
analyzed by restriction digest mapping and/or 
sequenced by the dideoxy method of Sanger, F., et al., 
Pt-qc, Natl- Acad. Sci. fUSAK (1977) 74:5463 as 
10 further described by Messing, et al. , Nucleic Acids 

Res. > (1981) 9:309, or by the method of Maxam, et al.. 
Methods in Enzvmolocrv , (1980) 65:499. 

Host strains useful in cloning and expression are 
as follows: 

15 For cloning and sequencing, and for expression of 

constructions under control of most bacterial 
promoters, E, coli strain MM294 obtained from E. coli 
Genetic Stock Center GCSC #6135, is particularly 
useful. For expression under control of the P,_Njjbs 

20 promoter, E. coli strain K12 MClOOO lambda lysogen, 
N7N53J.I857 SusPgo (ATCC 39531) , may be used. Also 
useful is E, coli DG116, (ATCC 53606). 

For M13 phage recombinants, E. coli strains 
susceptible to phage infection, such as E. colt K12 

25 strain DG9a, are employed. The DG98 strain has been 

deposited with ATCC July 13, 1984 and has accession 
number 39768. 

The thermostable enzyme of this invention may be 
used for any purpose in which such enzyme is necessary 

30 or desirable. In a particularly preferred embodiment, 

the enzyme herein is employed in the amplification 
protocol set forth below. 



wo 92/09689 



PCT/US91/09026 



-35- 

Examples 

The following examples are intended to 
illustrate, but not limit, the present invention. 

5 !• Culturina of PYrococcus furiosus and Preparation 

of Pf Cell Paste 

The following describes how the hyperthermophilic 
archaebacterium, P . furiosus . is routinely grown in a 
500 liter fermentor for the purpose of obtaining cell 

10 mass in sufficient cjuantities for large scale protein 

purification. It is a modified version [Bryant et 
al., J. Biol. Chem. , 264:5070-5079 (1989)] of the 
original protocol of Fiala et al. , Arch. Microbiol. . 
145:56-61 (1986). 

15 For culture maintenance, P . furiosus (DSM 3638) 

is routinely grown at 85-88 **C as a closed static 
culture in 100 ml of the medium described in Table 2 . 

Table 2 



20 Maltose 5 g/1 

NH^Cl 1.25 g/1 

Elemental Sulfur 5 g/1 

NajS 0.5 g/1 
Synthetic Sea Water^ 

25 Vitamin mixture^ 1 ml /I 

FeCl3 25 MM 

NagWO^ 10 fiK 

Yeast Extract 0.01% 



30 1. Synthetic Sea Water: 

NaCl, 13.8 g/1 
MgSO^, 3.5 g/1 
MgCl2, 2.7 g/1 
KCl, 0.3 g/1 

35 CaClg, 0.75 g/1 
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KHgPO^, 0.5 g/1 
NaBr, 0.0-5 g/1 
KI, 0.05 g/1 
H3BO3, 0.015 g/1 
5 Sodium citrate, 0.005 g/1 

2. Vitamin mixture [Balch et al., Microbiol. Rev. , 
43:260-296 (1979)]: 
Biotin, 2 mg/1 
Folic acid, 2 mg/1 
10 Pyridoxine hydrochloride, 10 mg/1 

Thiamine hydrochloride, 5 mg/1 
Riboflavin, 5 mg/1 
Nicotinic acid^ 5 mg/1 
DL-Calcixim pantothenate, 5 mg/1 

15 Vitamin B^^' ^9/1 

p-Aminobenzoic acid, 5 mg/1 
Lipoic acid, 5 mg/1 



Growth is monitored by the increase in turbidity 

20 at 600 nm. Cells can be stored in the same medium at 

4**C and remain viable for at least a year, although 
periodic transfer is recommended. 

Large scale (preparative) growth of P. furiosus 
was performed as follows: 

25 Growth medium according to Table 1, was prepared, 

except that the sulfide was replaced with titanium 
(III) nitrilotriacetate [final concentration, 30 fxH as 
described in Moench et al., J. Microbiol. Meth. . 
1 J 199-202 (1983)] and the elemental sulfur is omitted. 

30 The medium was then sparged with Argon (Ar) . 

A two liter flask was inoculated with two 100 ml 
cultures. The two liter culture was used as an 
inoculum for a 20 liter culture. Two 20 liter 
cultures were used to inoculate a 500 liter culture. 

35 The culture was maintained at 88*»C, bubbled with Ar 
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(7,5 liters/min) and stirred at about 50 rpm. After 
about 2 0 hours (A^q 0,5) the cells were harvested 
with a Sharpies continuous flow centrifuge at 100 
liters/hour. The cells were frozen in liquid N2 
5 immediately after harvesting. The yield of cells is 
typically 400-600g wet weight. 

It should be noted that P. furiosus has a 
fermentative type of metabolism and produces organic 
acids, COg and as final products. Hg production 

10 inhibits growth, so cultures have to be sparged with 

Ar (or any inert gas) to remove Hg. Alternatively, 
elemental sulfur may be added. In this case, the 
reductant that would otherwise be used to generate H2 
is used to reduce elemental sulfur to HgS. The 

15 addition of elemental sulfur is convenient for small 
scale cultures in glass vessels, but its reduction 
cannot be used to remove inhibitory in 500 liter 
stainless steel fermentors because of the corrosive 
nature of HgS. 

20 

2. Purification of Pf DNA Polvmerase I 
A. Lysis of Pf Cell Paste 

Fifty grams (g) of Pf cell paste prepared in 
Example 1 were thawed at room temperature. Two 

25 hundred milliliters (ml) of lysis buffer consisting of 
50 millimolar (mM) Tris-HCl, pH 8.2, 10 mM beta 
mercaptoethanol, 1 mM EDTA and 200 microgram/ml 
(/xg/ml) of lysozyme were admixed to the thawed cell 
paste. The admixture was thereafter maintained for 30 

30 minutes at room temperature. The maintained admixture 
was processed in a French press for two cycles. The 
cell lysate was sonicated for 10 minutes at room 
temperature and centrifuged at 16,000 RPM in a SA600 
rotor for 60 minutes at room temperature and the 

35 supernatant recovered. 
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B. Column Chroittatoaraphv of Pf Cell Lysate 

The supernatant prepared above was loaded on 
to a Q-sepharose (2,5 X 40 centimeter) column at room 
temperature. The column containing the cell lysate 
5 supernatant was then washed with 200 ml of column 

buffer (50 mM Tris-HCl, pH 8.2, 10 mM beta 
mercaptoethanol and 1 mM EDTA) . The colvimn pass 
through and the washes were collected, pooled, and 
then centrifuged at 9000 X g in a Sorvall GS3 rotor at 

10 room temperature to remove any insoluble material. 

The resulting supernatant, containing partially 
purified Pyro polymerase, was recovered from the 
pellet and loaded directly onto a phosphocellulose 
column (2.5 X 40 cm) at room temperature. The column 

15 was washed with coltimn buffer to remove any proteins 
that did not bind to the colximn until the optical 
density measured at an absorbance of 280 nm dropped to 
baseline. The immobilized Pyro polymerase was 
thereafter eluted with a one liter linear gradient of 

20 NaCl ranging in concentration from 0 M to 0.7 K 

dissolved in colximn buffer and 10 ml fractions were 
collected. 

C. Assay for Pfu PNA Polymerase I Activity 
The collected fractions were separately 

25 assayed for Pyro polymerase activity. The following 
reagents were admixed to form a reaction cocktail for 
the measurement of Pyro polymerase activity; final 
concentrations (fc) of the reagents in the cocktail 
are in parentheses: 

30 (1) 200 microliters (/il) active calf thymus 

DNA, 575 /il distilled water, 20 /xl 1 M Tris-HCl, pH 
7.5 (fc=20 mM) ; 8 /il 1 M MgClg (fc=8 mM) ; 

(2) 10 Ml 0.75 M DTT (fc=7.5 mM) ; 4 til 15 
mg/ml BSA (fc=50 /xg/ml) ; 

35 (3) 30 fil 10 mM each of dATP, dCTP, dGTP 
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dTTP (fc=0.15 mM for each); and 

(4) 50 Hi ^H-TTP in ethanol 432A (Amersham 
Inc., Arlington Heights, IL) for a total volxme of l 
ml. 

5 To perform the Pyro polymerase (DNA Polymerase I) 

activity assay, 25 m1 of the reaction cocktail formed 
above was admixed with 1 to 5 ul of each collected 
fraction. The admixture was maintained for 10 to 60 
minutes at 75 ®C which was the optimal temperature for 

10 enzymatic activity to form a labelled DNA admixture. 

After the maintenance period, 2 to 5 /il of the labeled 
DNA admixture^ ^as pipetted onto DE81 (Whattman) filter 
paper. The filter paper containing the labeled DNA 
admixture was dried on aluminim foil under a heat lamp 

15 for 5 minutes. After the drying period, each filter 

was washed three times for 5 minutes with 50 ml of 2X 
SSC (0.3 M NaCl, 0.03 M NaCitrate) followed by one 
quick wash with 100% cold ethanol. The washed filters 
were immediately placed fresh alvuainum foil and placed 

20 under a heat lamp for 5 minutes to dry the filters. 
The dried filters were separately placed in 
scintillation vials containing 5 ml scintillation 
fluid. 

The ^H-DNA immobilized on the filter paper, which 
25 reflects Pyro polymerase activity, was measured in a 

scintillation counter. The results of this assay 
indicated that the peak fractions from the 
phosphocellulose column containing the highest 
concentration of Pyro polymerase were eluted with 200 
30 mM NaCl and that Pyro polymerase constituted about 10% 
of the total protein present in those fractions. 
D. FPLC Purification of Pvro Polymerase 

The fractions containing approximately 90% 
of the total DNA polymerase I activity as measured 
35 above were pooled and dialyzed against column buffer 
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overnigiit at 4°C to form a NaCl-free Pyro polymerase 
solution • The dialyzed salt-free Pyro polymerase 
solution was loaded onto a Mono S HR 5/5 FPLC (fast 
phase liquid chromatography) column (Pharmacia, 
5 Piscataway, If J) previously equilibrated with the 

before-described column buffer. The Mono S column 
containing the Pyro polymerase was washed with about 
four column volumes of column buffer prior to elution 
to remove any proteins that did not bind to the 
10 column. The immobilized proteins were eluted with a 
one liter linear gradient of NaCl ranging in 
concentration from 0.0 M to 0.7 M dissolved in column 
buffer . 

Fractions were collected and assayed for the 

15 presence of FPLC purified Pyro polymerase activity as 

described, respectively, in B and C above. The 
results of this assay indicated that the peak 
fractions from the Mono S column containing the 
highest concentration of FPLC purified Pyro polymerase 

20 were eluted with 120 mM NaCl. The fractions 
containing 90% of the peak FPLC purified Pyro 
polymerase activity were pooled and dialyzed against 
the column buffer additionally containing 10% glycerol 
overnight at room temperature to form NaCl-free FPLC 

25 purified Pyro polymerase. 

The resultant purified and dialyzed Pyro 
polymerase was then subjected to a final purification 
on a 1.5 X 20 cm Matrix gel Blue A column (Amicon, 
Danvers, MA) • The Matrix gel Blue A colvimn was first 

30 equilibrated with the before-described column buffer 
containing 10% glycerol, 0.1% Tween 20 
(polyoxyethylenesorbitan monolaurate) and 0.1% non- 
idet P40 (octylohenol-ethyl ene oxide condensate 
containing an average of 9 moles ethylene oxide per 

35 mole of phenol) . The purified and dialyzed Pyro 
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polymerase was then applied to the column using FPLC 
pumps. The column containing the Pyro polymerase 
sample was then washed with two column volumes of the 
glycerol-containing column buffer to remove any 
5 proteins that did not bind to the column. The 

immobilized Pyro polymerase was eluted from the column 
with a one liter linear gradient of KCl ranging in 
concentration from 0.0 M to 0.7 M KCl. 

Eluted fractions from the Af f i-gel column were 

10 collected and assayed for the presence of purified 

Pyro polymerase activity as described in Examples 2B 
and C above. The fractions eluted with 200 to 300 mM 
KCl contained the peak Pyro polymerase activity, with 
the optimal activity recovered at about 280 mM KCl. 

15 The peak fractions were pooled and concentrated 

through Centricon-3 0 columns which have a molecular 
weight cut-off at 30,000 kD (Amicon, Beverly, MA) to 
form a concentrated solution of purified Pyro 
polymerase. The purified Pyro polymerase was 

20 thereafter dialyzed against column buffer containing 
50% glycerol to form KCl-free purified Pyro 
polymerase. The resultant salt-free Pyro polymerase 
was determined to be about 95% homogeneous. 

25 3. Molecular Weight Determination 

The molecular weight of the purified Pyro 
polymerase prepared in Example 2D was determined by 
SDS-PAGE under non-denaturing conditions according to 
the method of Laemmli et al., J, Mol. Biol. . (1973) 

30 80:575-599. Samples of Pyro polymerase, Tag 

polymerase, phosphorylase B, and bovine serum albumin, 
were applied to a 6-18% gradient, 1 mm thick, SDS- 
polyacrylamide gel (Novex, Encinitas, CA) and 
electrophoresed in a running buffering containing 1% 

35 SDS, 2.4 mM Tris, and 18 mM Glycine. The results of 
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that analysis^ shown in Figure 1, indicate that Pyro 
polymerase migrates faster than phosphorylase B 
(Sigma, St. Louis MO; molecular weight 97,200 daltons) 
and Tag polymerase (Perkin-Elmer Cetus^ Norwalk, CT; 
5 94,000 daltons), but slower than BSA (Sigma, 66,000 

daltons) . Because of its proximity to Tag, Pyro 
polymerase was assigned a relative molecular weight of 
90, 000-93 , 000 daltons . 

10 4. Fidelity Assays 

Various assays were performed to complete the 
characterization of DNA Polymerase I purified from jFLs. 
f uriosus , In the assays described below, the Pyro 
polymerase was compared to the commercially available 

15 and well characterized Thermus aquaticus (Tag) DNA 

polymerase (U.S. Patent No. 4,889,818). 

To determine the error rate of Pyro polymerase, 
fidelity assays by PGR were performed with the Pyro 
polymerase in an assay procedure generally described 

20 by Koliler et al, Proc, Natl. Acad, Sci. USA , 88:7958- 

7962 (1991) , in which in vivo mutagenesis was 
monitored during PGR amplification of transgenic mouse 
genomic DNA containing 33 copies per cell of the 
lacIOZa transgene. The entire lac I gene plus the 

25 alpha-complementing fragment of the beta-galactosidase 

gene was amplified (30 to 40 rounds) by PGR to form 
ganplified DNA. The amplified 1.9 kb DNA was then 
cloned into the EcoR I site of lambda gtlO and plated 
on host strain DHSalpha (lacZAM15) which contains the 

30 alpha fragment of beta-galactosidase. 

The complementation of the two proteins resulted 
in enzymically-active beta-galactosidase, which was 
detected as blue plagues when X-gal (5-bromo-4-chloro- 
3-indolyl-beta-D-galactosidase) was present as a 

35 substrate. In contrast, a functional non-mutant lac I 
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repressed the expression of beta-galactosidase which 
resulted in white plaques. The mutation frequency in 
lac I was defined as the proportion of mutant blue 
plac[ues to the total number of placjues scored. The 
5 error rate was then calculated by the formula E = 

2(mf/d) where mf was the observed mutation frequency 
in the PCR product (lac I) and d was the number of 
doublings according to Saiki et al.^ Science, 239:487- 
491 (1988). 

10 Results from these studies indicate that Pyro 

polymerase has an error rate (mutations per nucleotide 
per PCR cycle) as low as 1 X 10'^ compared to 1 X 10'^ 
for Taq DNA polymerase as described by Eckert et al., 
Nucl. Acids. Res ,. 18:3739-3744 (1990). ThusPyro 

15 polymerase eschibits about a ten-fold greater 
replication fidelity than Taq DNA polymerase. 

5. Exonuclease 3 ' to 5 ' Activity Assays 

Purified Pyro polymerase, prepared as in Example 

20 2D, was assayed to measure its 3« to 5» exonuclease 
activity. The exonuclease assay was conducted 
according to Chase et al, J.Biol.Chem> . 249:4545-4552 
(1974), except that the reaction was at 72^C instead 
of 37 ®C and the substrate was Taq I-digested lambda 

25 DAN filled in with tritium labelled dCTP and dGTP. 

Briefly, the following reagents were admixed to form a 
reaction cocktail for the measurement of 3 • to 5 » 
exonuclease activity; final concentrations (fc) of the 
reagents in the cocktail are in parentheses: 

30 (1) 40 Ml 1 M Tris-HCl, pH 7.5 (fc=40 mM) ; 

(2) 10 Ml 1 M MgClg (fc=10 mM) ; 

(3) 13.3 /xl 0.75 M DTT (fc=10 mM) ; 

(4) 100 Ml labelled Taq 1 cut lambda DNA 
filled in with Sequenase^** and labelled with dCTP 

35 and ^HdGTP obtained from Amersham, Arlington Heights, 
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IL ; and 

(5) 636.7 fil distilled water for a total 
voliame of 1 ml. 

For the 3" to 5' exonuclease activity assay, 20 
5 to 25 fjLl of the prepared reaction cocktail was admixed 
with either Pyro or Taq DNA polymerase. The admixture 
was maintained for 10 to 60 minutes at 72<*C to form 
hydrolyzed lambda DNA, specifically phosphate 
mononucleotides . The reaction in each admixture was 
10 terminated by admixing 5 /il of 15 mg/ml bovine serum 
albumin (BSA) and 13 Ml of 50% trichloroacetic acid 
(TCA) and maintaining the admixture on ice for 15 
minutes. The terminated reaction admixture was then 
centrifuged at 12,000 X g for 15 minutes to pellet the 
15 unhydrolyzed intact lambda DNA. 

The resultant supernatant containing the 
5 ' exonuclease-derived phosphate mononucleotides was 
removed from the pellet. Forty /xl of each supernatant 
was admixed with 80 ul distilled water and 1 ml 
20 scintillation fluid. The amount of radioactivity 
detected by scintillation counting was a relative 
measure of the exonuclease activity of the Pyro and 
Taq DNA polymerase preparations. 

The results of this study show that Pyro 
25 polymerase exhibits detectable 3« to 5' exonuclease 
activity, whereas Taq polymerase does not. 

To determine the amount of non-specific nuclease 
activity present in the Pyro polymerase preparation, 
Pyro polymerase prepared in Example 2D was admixed 
30 with 20 to 25 fil of a reaction cocktail. The 

following reagents were admixed to form the reaction 
cocktail for the measurement of non-specific nuclease 
activity; final concentrations (fc) of the reagents in 
the cocktail are in parentheses: 
35 (1) 40 Ml 1 M Tris-Hcl, pH 7.5 (fc=40 mM) ; 
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(2) 10 Ml 1 M MgClg (fc=lO mM) ; 

(3) 13.3 /xl 0.75 M DTT (fc=10 mM) ; 

(4) 100 til ^H-labelled E. coll chromosomal 
DNA (sheared ten times through a 21 gauge needle) ; and 

5 (5) 835.7 /il distilled water for a final 

volxime of 1 ml. 

The admixture containing the reaction cocktail 
and the homogenized gel were maintained for 10 to 60 
minutes at 75**C which was the optimal temperature for 

10 the enzyme to form hydrolyzed nucleic acid product. 
The reaction in the admixture was terminated and 
supernatant was assayed as described in Example 2C 
above. The results of this assay show that Pyro 
polymerase did not exhibit detectable non-specific 

15 nuclease activity. The 3' to 5« exonuclease activity 

by Pyro polymerase is, therefore, specific and not due 
to non-specific nuclease activity. 

Thus, pfu DNA polymerase I is a thermostable DNA 
polymerase that, unlike Tag DNA polymerase, possesses 

20 a 3* to 5* exonucleases activity which enables pfu 

polymerases to proofread errors, and threrby exhibits 
a ten-fold greater fidelity during DNA synthesis 
reactions. 

25 6. PGR with Pvro Polymerase 

The specificity of Pyro polymerase was evaluated 
in PGR amplification of template DNA compared to that 
achieved with Tag DNA polymerase. To prepare the 
template DNA for the reactions, genomic DNA was first 

30 purified by phenol extraction and alcohol 

precipitation according to the procedures of Ausubel 
et al.. Current Protocols in Molecular Biology , John 
Wiley and Sons (1987), from blood obtained by tail 
bleed from a transgenic mouse having about 1 to 2 

35 copies of a lambda transgene vector, which is referred 
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to as lambda transgene mouse genomic DNA. 

The Iz-lambda polynucleotide primers witlx the 
used in the PCR amplifications were prepared by 
chemical synthesis using a model 381A polynucleotide 
5 synthesizer (Applied Biosystems Inc., Foster City, CA) 

according to the manufacturer's instructions. 

A hybridization reaction admixture was formed by 
combining the following reagents in a sterile 0.5 ml 
microfuge tube: 
10 (1) 80 Ml of sterile, autoclaved water; 

(2) 10 Ml of 1<^X reaction buffer containing 
500 mM KCl^ 100 mM Tris-HCl, pH8.3, 15 mM MgClg, and 
0.1% sterile gelatin; 

(3) 8 /il of a solution containing 2.5 mM 
15 each of the deoxynucleotidetriphosphates (dNTP's) 

dGTP, dCTP, dTTP, and dATP; 

(4) 1 /il of a solution containing 250 ng 
each of the two polynucleotide primers described 
above; and 

20 (5) 1 Ml of lambda transgene mouse genomic 

DNA. 

The hybridization reaction admixture was heated^ 
to 94**C and maintained at 94 ^'C for 1 minute to 
denature the duplex genomic DNA present and form 

25 single-stranded templates. The admixture was then 

cooled to 54 ®C and maintained for 2 minutes to allow 
hybridization to occur and form duplex DNA. The 
hybridized admixture was thereafter centrifuged in a 
microfuge at 12,000 X g for 10 seconds to collect 

30 condensation off the microfuge tube walls. To 

separate microfuge tubes, 0.5 ul of a solution 
containing 2.5 units of either Taq DNA polymerase 
(obtained from Perkin-Elmer Cetus, Norwalk, CT or from 
Stratagene, La Jolla, CA) or Pyro polymerase prepared 

35 in Example 2D was admixed to form a primer extension 
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reaction admixture. Additional eparate primer 
extension reaction admixtures wc e made by diluting 
the amount of Pyro polymerase from 1:1.5, 1:2, 1:2.5, 
1:3, 1:4 and 1:5 to determine the optimal 
5 concentration of DNA synthesis. (The 1:3 dilution 

represented about 1.5 unit of Pyro polymerase per 
microliter. ) 

Each microfuge tube containing the above-prepared 
primer extension reaction admixture was overlayed with 

10 50 fil of mineral oil and then place into a DNA Thermal 
Cycler (Perkin-Elmer Cetus) and subjected to the 
following temperature and time conditions: 1) 94**C 
for 1 minute to denature duplex DNA; 2) cooled to 54**C 
for 2 minutes to anneal the primers; and 3) heated to 

15 74**C for 1.0 minute to activate the polymerase and 
maintained at 74 «C for 0.5 minutes to extend the 
annealed primers. The tubes were subjected to 30 
cycles of the above sequence according to the 
manufacturer's instructions. The cycled tubes were 

20 then maintained at 72'*C for 10 minutes followed by 4«C 
for 12 hours. 

The contents of each primer extension reaction 
admixture were analyzed on a 6% polyacrylamide gel in 
IX TBE by loading 35 /il of the admixture sample and 5 

25 fjLl of lOX sample buffer onto an 8 centimeter gel, 

electrophoresing the gel at lOOV for about one hour 
followed by. staining the electrophoresed gel with 
ethidium bromide to visualize the electrophoresed 
nucleic acids. 

30 The results of the electrophoresed PGR amplified 

lambda transgene mouse genomic DNA using either Pyro 
or Tag DNA polymerase indicted tLat the Taq from Cetus 
and Stratagene gave nr rly identical results. The 
results using the Pyro DNA polymerase to amplify 

35 genomic DNA indicate that it produces less background. 
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The 1:2.5 dilution of Pyro polymerase resulted in 
optimal PGR amplification. 

7. Iiarae Scale Preparation of Pvrococcus furiosus 
5 DNA Polymerase I 

The following steps 1-11 are performed at room 
temperature: 

1. Thaw 1500 grams of frozen cell paste at room 
temperature. Add 4 voliimes (6000 mis) of lysis buffer 

10 A. 

2. Resuspend the cells and incubate at room 
temperature for 30 minutes. Cycle the cells through a 
French press two times. 

3. Sonicate the product of step 2 for 10 

15 minutes at room temperature. Centrifuge the resulting 
lysate at 9K RPMs in a GS3 (Sorvall) rotor at room 
temperature for 60 minutes. 

4. Collect the supernatant (Fraction I) and 
load it directly onto a Q-sepharose (8 X 30 cm) 

20 column. Wash the loaded column with 4 volumes (6 

litres) of buffer B. 

5. Collect the pass-through and washes. Adjust 
the collected material to pH 6.0 with HCl, and 
centrifuge it at 9K in a GS3 rotor at room temperature 

25 for 60 minutes to remove the protease- containing 
precipitate. 

6. Collect the supernatant and adjust its pH to 
7.8 with NaOH (Fraction II). Load it at 50 ml/minute 
directly onto a 500 ml radial flow P-11 

30 (phosphocellulose) (Sepragen, San Leandra, CA) column 

equilibrated with buffer B. Wash the column with 1-2 
litres of buffer B \intil the ODggo of the wash is back 
to baseline. 

7. Elute DNA polymerase activity with a 0-0. 7M 
35 NaCl gradient in buffer B (2 X 2500 ml) . The peak of 



wo 92/09689 



PCT/US91/09026 



-49- 

polymerase activity elutes at approximately 0.2-0,4M 
NaCl. 

8. Assay and pool those fractions containing 
greater than 90% of the total polymerase activity. 

5 All subsequent steps are performed at 4**C. 

9. Dialyze the pooled material of step 8 
overnight against 200 volumes buffer B at 4®C. 

10. Remove the dialyzed Pyro polymerase and 
clarify it with a 30 minute centrifugation if 

10 necessary (Fraction III) • Load dialysate onto a 

heparin-agarose colvimn (5 X 20 cm) equilibrated with 
buffer B. Wash with 1 litre buffer B and elute with a 
Q-0.7M NaCl gradient in buffer B (2 X 1.5 litres). 

11. Assay for polymerase activity, and pool the 
15 fractions containing greater than 90% of total 

activity. Dialyze the recovered material against 200 
volumes of buffer C. 

12. Remove the Pyro polymerase from dialysis 
(Fraction IV) and load it onto an Affigel Blue column 

20 (2.5 X 20 cm) equilibrated with buffer C. 

13. Wash the Affigel-Blue col\imn with buffer C 
until the OD 280 approaches baseline and elute the 
Pyro polymerase with a 0-0. 7M KCl gradient in buffer C 
( 2 X 1000 ml) . 

25 14. Assay for polymerase and 3* exonuclease 

activities. Analyze the active fractions by silver 
stained SDS-PAGE gels. Pool the pure Pyro polymerase 
(greater than 90% pure on a w/w basis) fractions based 
on visual analysis silver stained gel. 

30 15. Dialyze overnight against 200 volumes of 

buffer D at 4*»C. Remove the dialysate (Fraction V) 
and load it onto an 1.5 x 30 cm Pll (phosphocellulose) 
column equilibrated with buffer D. Wash with buffer D 
until wash OD 280 is baseline. Elute the Pfu DNA Poll 

35 with a 2 X 250 ml linear gradient 0.0-0.7M kCl 
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prepared in buffer D. 

16 . Assay for polymerase and 3 * exonuclease 
activities. Analyze tlxe active fractions by silver 
stained SDS-PAGE gels. Pool the pure Pyro polymerase 

5 (greater than 95% pure on a w/w basis) fractions based 

on visual analysis silver stained gel. 

17. Pool and dialyze overnight at 4*C against 20 
volumes buffer E. Remove from dialysis (fraction VI) 
and store at -20®C. 

10 

BUFFERS 

Buffer A! re litres^ 
50 mM Tris-Cl pH 8.2 
10 mM beta mercaptoethanol 
15 1 mM EDTA 

200 t^g/isil lysozyme 

Buffer B: r20 litres) 
50 mM Tris-Cl pH 8.2 
20 10 mM beta mercaptoethanol 

1 mM EDTA 

Buffer C: (6 litres) 
50 mM Tris-Cl pH 8.2 
25 10 % glycerol 

1 mM EDTA 
1 mM DTT 
0.1% tween 20 
0.1% nonidet P40 

30 

Buffer D: (2 litres) 
50 mM Tris-Cl pH 8.2 
10% glycerol 
1.0 mM EDTA 
35 10 mM beta mercaptoethanol 



0.1% tween 20 
0,1% nonldet P40 



Buffer E 

50 Tris Hcl pH 8.2 
0.1 mM EDTA 
1 mM DTT 
0.1% Tween 20 
0.1% NP 40 
50% glycerol 

RESINS 

1.5 LITRES Q-SEPHAROSE (PHARMACIA) 
700 MLS P-11 (WHATMANN) 
200 MLS HEPARIN-AGAROSE (BIORAD) 
200 ML AFFIGEL-BLUE (BIORAD) 

8. Production of Pfu I 

Cell Grovth : Pyrococcus furiosus (DSM 3638) was 
grown at 85-88 **C as closed static cultures in a medium 
containing maltose (5g/liter) , NH^Cl (1.25 g/liter) 
elemental sulfur (S% 5 g/liter), NagS (0.5 g/liter) 
synthetic sea water (17), a vitamin mixture (18), 
FeClj (25 mM) , NaWO^ (lOinM) and yeast extract (1.0 
g/liter) . Growth was monitored by direct cell count 
and by the increase in turbidity at 600 nm. For large 
scale cultures, sulfide was replaced with titaniiam 
(III) citrate, and was omitted which necessitated 
sparging with Ar. Two 20 liter cultures served as an 
innoculum for growth in a 400 liter feirmenter where 
the cultures were maintained at 88«*C, bubbled with Ar 
(7.5 liters/min) and stirred (50 rpm) . Cells were 
harvested after approximately 2 0 hours (OD^q-0.5) with 
a Sharpies centrifuge at 100 liters/hour. 
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NOTE: The following procedures are performed at 25«*C, 

unless otherwise stated. 

Cell Lvsis ; The night before lysis, 500 grams of 

frozen cell paste is transferred to a 2-8 C 
5 refrigerator. The next morning, the cells are 

transferred to a 4 liter stainless steel beaker. The 

cells are resuspended using 4 volumes (2000 ml) of 

lysis Buffer 8A. The cell suspension is then 

incubated at room temperature for 1.5 hours. The 
10 cells are lysed in the French Press using 2 passes at 

8K PSI. The lysate is then sonicated at room 

temperature for 10 minutes. 

Following sonication, the lysate is transferred 

to 400 ml bottles and spun for 60 minutes at 9K rpm at 
15 room temperature in the Sorvall RC-2B using a Sorvall 

GS3 rotor. The supernatant (Fraction I) is collected 

and the volume measured. 

Q-Sepharose Column: Fraction 1 is loaded 

directly onto a 8 x 30 cm radial flow Q-sepharose 
20 column (-1500 ml) pre-eguilibrated in Buffer IB at a 

flow rate of approximately 50 ml/minute. The column 

is then washed with 4 column volumes (6000 ml) of 

Buffer 8B. 

NOTE: It is important to carefully collect the 
25 pass-thru and wash fractions as they contain the Pfu 

polymerase enzyme 

The Q-Sepharose pass-thru and column washes are 
next combined as one pool (Fraction II) . 

BPA-1000 Precipitation : Bioprocessing aid 
30 BPA-1000 (TosoHaas) pilots are conducted on Fraction 

II to determine the appropriate volume required to 
precipitate cell debri and nucleic acids but not Pfu 
polymerase. 1.0 ml aliquots of Fraction II are placed 
in 8 tubes. The BPA-1000 is mixed thoroughly and 0, 
35 2, 4, 6, 8, 10, 12, 14, 16, 18 and 20 ml of BPA-1000 
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are added to the tubes respectively, mixed gently, 
incubated for 5 minutes, and spun in a microfuge for 5 
minutes. Carefully decant and save the supernatants . 
Samples of each tube are then evaluated using the 
5 gapped duplex polymerase assay. Evaluate the clarity 

of each supernatant and the firmness of each pellet. 
Based on the control tube (without BPA addition) , 
determine which concentration of BPA increases the 
clarity of the supernatant without sacrificing 

10 polymerase yield. The appropriate amount of BPA-1000 

is then added to Fraction II and stirred for 30 
minutes. The suspension is then transferred to 400 ml 
centrifuge bottles and spun at 9K rpm at room 
temperature (25*'C) for 60 minutes in a Sorvall RC-2B 

15 using a Sorvall GS3 rotor. The supernatant (Fraction 
III) is collected and the voliame measured. 

Fraction III is then adjusted to pH 7.5 with 1 N 
HCl if necessary. 

P-11 Cellulose Column ; Fraction III is loaded 

20 directly onto a 10 x 14 cm P-ll column (*1000 ml) 

pre-equilibrated in Buffer 8C at a flow rate of 
approximately 5 ml/minute. The column is washed with 
Buffer 8C until the OD^^ approaches baseline (4 column 
volumes) . The column is next eluted with a 2 x 4000 

25 ml gradient (0 to 700 mM KCl) prepared in Buffer 8C. 

25 ml fractions are collected, every fifth tube, 
put-on and pass-thru are assayed for polymerase 
activity. The fractions containing the peak of Pfu 
Pol I activity are pooled and concentrated to 

30 approximately 200 ml using an Amicon model CHj 

concentrator with a SIY30 membrane cartridge. Any 
remaining Pfu Pol I is washed from the concentrator by 
flushing the lines with an additional 200 ml Buffer 
SC. The concentrate and wash are combined (Fraction 

35 IV) . 
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NOTE: The following procedures are performed at 

Fraction IV is next dialyzed overnight against 18 
liters of Buffer D. 
5 Hi -Load nniumn : The following morning the 

dialysate is transferred to 400 ml centrifuge bottles 
and spun 9K rpm at 4°C for 60 minutes in a Sorvall 
RC-2B using the GS3 rotor. The supernatant (Fraction 
V) is recovered and the volume recorded. Fraction V 

10 is divided into two equal portions. The first portion 

is loaded directly onto a FPLC Hi-Load S coliamn ('58 
ml) pre-equilibrated in Buffer D at a flow rate of 5.0 
ml/minute. The colvimn is washed with Buffer 8D until 
the OD280 approaches baseline. The column is next 

15 eluted with a 2 x 250 ml gradient (0 to 250 mM KCl) 

prepared in Buffer D. 5.0 ml fractions are collected, 
every third tube, put-on and pass-thru are assayed for 
polymerase activity. The above procedure is repeated 
for the second portion of Fraction V. The fractions 

20 containing the peak Pfu Pol I activity are pooled and 

dialyzed overnight against 2x4 liters of Buffer 8E 
at 4 C. The following morning, the dialysate is 
removed from dialysis and the volume recorded 
(Fraction VI) . 

25 Heparin Sepharose CL-6B Column : Fraction VI is 

loaded onto a 1.5 x 90 cm heparin sepharose CL-6B 
column 159 ml) pre-equilibrated in Buffer &E at a 
flow rate of 0.5 ml/minute. The colunin is washed with 
1000 ml Buffer 8E. The column is next eluted with a 2 

30 X 750 ml gradient (0 to 300 mM KCl) prepared in Buffer 
8E. 10 ml fractions are collected, every third tube, 
put-on and pass-thru are assayed for polymerase 
activity. A protein gel is also recommended for peak 
evaluation. The fractions containing the peak Pfu DNA 

35 polymerase activity are pooled and dialyzed overnight 
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agalnst 2x4 liters of Buffer The following 

morning, the dialysate is removed from dialysis and 
the vol\ime recorded (Fraction VII) . 

Affl-Gel Blue Column ; Fraction VIT is loaded 
5 onto a 2,5 X 4.0 cm affi-gel blue column (* 20 ml) 

pre-equilibrated in Buffer 8E at a flow rate of 0.5 
ml/minute. The column is washed with Buffer 8E until 
the OD^gQ approaches baseline. The column is next 
eluted with a 2 x 500 ml linear gradient (0 to 250 mM 

10 KCl) prepared in Buffer 8E. 10 ml fractions are 

collected, every third tube, put-on and pass-thru are 
assayed for polymerase activity. A SDS-PAGE gel is 
also recommended for careful peak evaulation. The 
fractions containing the peak Pfu Pol I activity are 

15 pooled and dialyzed overnight against 1 liter final 
dialysis Buffer 8F. The following morning, the 
purified enzyme is removed from dialysis and 
transferred to -20**C storage and represents a purified 
pfu polymerase I enzyme of this invention. 

20 Buffer 8A : Lysis Buffer 

50 mM Tris-Cl, pH 8.2 
1 mM EDTA 

10 mM b-mercaptoethanol 
200 mg / ml ly s o zyme 

25 

Buffer 8B ; Q-Sepharose Buffer 
50 mM Tris-Cl, pH 8.2 

1 mM EDTA 
10 mM b-mercaptoethanol 

30 

Buffer 80 : Phosphocellulose Buffer 
50 mM Tris-Cl, pH 7.5 

1 mM EDTA 
10 mM b-mercaptoethanol 



35 
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Buffer 8D : High Load S Buffer 



50 


mH 


Tris- 


CI, pH 7.5 


1 


mK 


EDTA 




1 


mH 


DTT 




10 


% 


(v/v) 


glycerol 


0.1 


% 


(v/v) 


NP-40 


0.1 


% 


(V/v) 


Tween 20 



Buffer 8E : Aff igel Blue & Heparin Buffer 

IG 50 mM Tris-Cl, pH 8.2 

1 mM EDTA 

1 mM DTT 

10 % (v/v) glycerol 

0.1 % (v/v) NP-4a 

15 0.1 % (v/v) Tween 20 

Buffer 8F ; Final Dialysis Buffer 



50 


nH 


Tris- 


CI, pH 8.2 


0.1 


mM 


EDTA 




1 


nH 


DTT 




0.1 


% 


(V/v) 


NP-40 


0.1 


% 


(V/V) 


Tween 20 


50 


% 


(v/v) 


glycerol 



25 9. Cloning the Gene that Encodes Pvrococcus furiosus 

(PfuV DNA Polymerase I 

Pyrococcus furiosus (DSM 3 638) was grown as 
described by Bryant et al, J. Biol. Chem. , 
264 15070-5079 (1989) , with an additional supplement of 

30 lOmM Na^WO^ as described in the Examples. Following 

harvesting by centrifugation, genomic DNA was isolated 
from the biomass using Stratagene's genomic DNA 
isolation kit according to manufacturer* instructions. 
The DNA was then randomly sheared by several passages 

35 through an eighteen gauge needle and the fragments 
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were separated by sucrose gradient centrifugation. 
The size of the fragments present within the fractions 
of the sucrose gradient were next estimated by agarose 
gel electrophoresis. The fractions containing four to 
5 nine kilobase fragments were combined and ligated to 

EcoRl linkers and the resulting inserts were ligated 
into EcoRl cut Lambda Zap II vector (Stratagene, La 
Jolla, OA) to create a genomic Pvrococcus furiosus 
library. This library was plated with XLl-Blue E. 

10 coli (Stratagene) on LB plates. Plague lifts were 
performed on Duralose nylon filters (Stratagene) to 
isolate individual bacteriophage colonies containing a 
cloned insert. 

N-terminal amino acid sequence determination of 

15 pfu I purified as described in Example 2 was performed 

by the Wistar Institute. Briefly, partially purified 
protein was subjected to SDS-PAGE followed by 
electrotransf er to a nylon membrane. The band 
corresponding to Pfu polymerase was isolated and 

20 subjected to protein microsequencing. From this 

microsequencing analysis, the unambiguous sequence of 
the forty-eight N-terminal amino acids of the pfu 
polymerase I protein was determined. The 48 residues 
corresponds to residues 1 to 48 of SEQ ID NO 1. 

25 Internal amino acid sequence analysis was also 

performed by the Wistar Institute on tryptic digested 
fragment of the Pfu polymerase protein. 

Based on the N-terminal 48 amino acid residue 
sequence information, a series of degenerate PGR 

30 oligonucleotide primers were designed in pairs which 
would produce a 94 basepair (bp) PGR product. The 94 
bp product corresponds to amino acid residues within 
the 48 residues. These oligonucleotides (23 and 18 
bases corresponding to the two ends of the 94-mer} 

35 were designed such that the 3" terainal 8 nucleotides 
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of every sequence possible based on the known amino 
acid residue sequence within the 48 residues was 
present as a separate oliqonucleotide. The 23-mer 
corresponds to possible nucleotides at positions 224 
5 to 246, and the 18-mer corresponds to possible 

nucleotides at positions 300 to Whenever there 

was a wobble position in the rest of the 
oligonucleotide primer ^ a T was used. The rationale 
for these substitutions was based on the fact that the 

10 GC content of pfu is considerably low and that T 

mismatches are most tolerated by Taq polymerase. In 
this way, four oligos were needed for each of the two 
primer positions. Each of these oligos contained some 
mismatched bases but no degenerate positions. 

15 The PGR was performed on pfu genomic DNA with all 

16 possible primer pairs. Following agarose gel 
electrophoresis, it was determined that 12 of the 16 
amplification reactions produced the expected 94bp PGR 
product. One of the reactions containing the 94bp 

20 product was subjected to direct cycle sequencing 

(Stratagene) using both PGR primers used in the 
amplification reaction as sequencing primers • A 53 
base sequence deduced from the DNA sequencing gel was 
found to corresponded 100% to the known N-terminal 

25 amino acid sequence between a pair of primers , thus 
confirming that the sequence was from the gene 
encoding Pfu DNA polymerase. A 53 base 
oligonucleotide probe containing this sequence was 
next synthesized, and having the nucleotide sequence 

30 shown in SEQ ID NO 2 from nucleotides 247 to 299. 

This oligonucleotide was then end labelled with P 
and used to screen the Pfu genomic library. 
Approximately four Pyrococcus genomes were represented 
in the screened library. The probe was hybridized to 

35 plaque lifts at 42°C for two hours in Qui)cHyb 
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(Stratagene) and washed twice at room temperature for 
15 minutes each in 2X SSC 0.1% SDS, Thirteen putative 
clones were identified and the plac[ues cored and 
resuspended in 300ml SM. Ten ml of each lysate was 
5 transferred into IX Tag reaction buffer, heated to 
100°C for ten minutes, and 30 cycles of PGR was 
performed using the PGR primer pair which originally 
produced the expected 94 bp fragment (PGR was 
performed according to the procedure described in the 

10 GeneAmp kit [Cetus Perkin-Elmer] ) • Following 

electrophoretic analysis of the PGR products, three 
clones produced the expected 94 bp PGR product. These 
three lambda clones were excised into to a pBluescript 
plasmid (Stratagene) according to the manufacturer's 

15 protocol. The resulting transf ormants were then 

screened by PGR. PGR was performed on single colony 
transf ormants (lOO^G in IX Taif reaction buffer for 10 
minutes, then centrifuged briefly followed by 30 
cycles of PGR using the original primer pair and the 

20 procedure described in the GeneAmp kit. Three 

positive plasmid clones were identified from each 
excision and large scale plasmid preparations were 
performed on one colony from each clone. 

The three bonafide polymerase clones were next 

25 mapped. One had a relatively small ISOObp insert. 

The other two clone inserts were about 4500 basepairs; 
one containing about lOOObp of the Pfu polymerase 
sequence while the other clone contained the entire 
polymerase gene. The clone having the entire pfu 

30 polymerase I gene was named pF72. 

The insert of clone pF72 was sequenced on both 
strands using Sequenase^" (USB) and custom 
oligonucleotide primers using a primer walking 
strategy. The sequence of the polymerase gene is 

35 shown in SEQ ID NO 2 from nucleotide bases 224 to 



wo 92/09689 



PCT/US91/09026 



-60- 

2548^ and consists of a 2265 bp DNA segment encoding 
775 amino acids corresponding to a 90,113 Dalton 
protein* 

The Pfu gene can be cloned into an expression 
5 system for production of the recombinant protein. The 
complete coding region of the pfu polymerase I gene is 
PCR-amplif ied using Pfu polymerase to limit mutations 
and the PGR product product is ligated in reading 
frame into the vector pRSET (Invitrogen) . The ligated 

10 vector is transformed into an F* containing E. coli 
host strain and protein production is induced with a 
recombinant M13 phage expressing cloned T7 RNA 
polymerase under control of the lacUV5 promoter- The 
expression vector is designed to introduce an affinity 

15 tail which can be used to facilitate purification of 

the recombinant protein. Following cell lysis and 
clarification of the crude supernatant, the 
recombinant Pfu polymerase protein is isolated in a 
single step by metal affinity chromatography. 

20 In another strategy, the coding region of the Pfu 

DNA polymerase gene is amplified with Pfu polymerase 
using primers which introduce a unique restriction 
site at the ends of the PGR product to facilitate 
cloning into a pBluescript vector. When cloned into 

25 the T3 orientation, the protein is expressed under 

control of the lacZ promoter. Following expression, 
recombinant Pfu polymerase is purified with a 
modification of the procedure used to purify the 
native enzyme. Briefly, a heat precipitation step is 

SO employed following cell lysis and clarification. The 

heat precipitation step denatures and precipitates the 
majority of E. coli host proteins but not the 
thermostable Pfu polymerase and the remaining soluble 
fraction is recovered and used as described before as 

35 a source for purification of the pfu polymerase I 



wo 92/09689 



PCr/US91/09026 



-61- 

protein. 

The foregoing is intended as illustrative of the 
present invention but not limiting. Numerous 
5 variations and modifications can be effected without 

departing from the true spirit and scope of the 
invention. 
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PARTIAL SEQUENCE LISTING 
(The remaining sequences are in the specification) 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Mathur. Eric A 

(ii) TITLE OF INVENTION: Purified Thermostable Pyrococcus 
Furiosus DNA Polymerase I 

(iii) NUMBER OF SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Bingham and Fitting 

(B) STREET: 11230 Sorrento Valley Road, Suite 200 

(C) CITY: San Diego 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 92121 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1-0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 02-DEC-I991 

(C) CLASSIFICATION: 

(vil) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/620,568 

(B) FILING DATE: 03-DEC-1990 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/657,073 

(B) FILING DATE: 19 -FEB- 1991 

(vii) PRIOR APPLICATION DATA: 

CA) APPLICATION NUMBER: US 07/776,552 
(B) FILING DATE: 14-0CT-1991 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Fitting, Thomas 

(B) REGISTRATION NUMBER: 34.163 

CC) REFERENCE/DOCKET NUMBER: STGOIOOP 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 619-587-3533 

(B) TELEFAX: 619-587-0574 
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(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 775 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

Met He Leu Asp Val Asp Tyr He Thr Glu Glu Gly Lys Pro Val He 
15 10 15 

Arg Leu Phe Lys Lys Glu Asn Gly Lys Phe Lys He Glu His Asp Arg 
20 25 30 

Thr Phe Arg Pro Tyr He Tyr Ala Leu Leu Arg Asp Asp Ser Lys He 
35 40 45 

Glu Glu Val Lys Lys He Thr Gly Glu Arg His Gly Lys He Val Arg 
50 55 60 

He Val Asp Val Glu Lys Val Glu Lys Lys Phe Leu Gly Lys Pro He 
65 70 75 80 

Thr Val Trp Lys Leu Tyr Leu Glu His Pro Gin Asp Val Pro Thr He 
85 90 95 

Arg Glu Lys Val Arg Glu His Pro Ala Val Val Asp He Phe Glu Tyr 
100 105 110 

Asp He Pro Phe Ala Lys Arg Tyr Leu He Asp Lys Gly Leu He Pro 
115 120 125 

Met Glu Gly Glu Glu Glu Leu Lys He Leu Ala Phe Asp He Glu Thr 
130 135 140 

Leu Tyr His Glu Gly Glu Glu Phe Gly Lys Gly Pro He He Met He 
145 150 155 160 

Ser Tyr Ala Asp Glu Asn Glu Ala Lys Val He Thr Trp Lys Asn He 
165 170 175 
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6A 



Asp 



Arg 



Leu Pro Tyr Val Glu Val Val Ser Ser Glu Arg Glu Met He Lys 



180 



185 



Phe Leu Arg He He Arg Glu Lys Asp Pro Asp lie He Val Thr 



205 



195 200 
Tyr Asn Gly Asp Ser Phe Asp Phe Pro Tyr Leu Ala Lys Arg Ala Glu 



210 



215 



Lys Leu Gly He Lys Leu Thr He Gly Arg Asp Gly Ser Glu Pro Lys 



225 



230 



Met Gin Arg He Gly Asp Met Thr Ala Val Glu Val Lys Gly Arg He 
245 250 255 

His Phe Asp Leu Tyr His Val He Thr Arg Thr He Asn Leu Pro Thr 
260 265 270 

Tvr Thr Leu Glu Ala Val Tyr Glu Ala He Phe Gly Lys Pro Lys Glu 
^ 275 280 285 

Lys Val Tyr Ala Asp Glu He Ala Lys Ala Trp Glu Ser Gly Glu Asn 
290 295 300 

Leu Glu Arg Val Ala Lys Tyr Ser Met Glu Asp Ala Lys Ala Thr Tyr 



305 



310 



315 



Glu Leu Gly Lys Glu Phe Leu Pro Met Glu He Gin Leu Ser Ar| Leu 



325 



330 



Val Gly Gin Pro Leu Trp Asp Val Ser Arg Ser Ser Thr Gly Asn Leu 



340 



345 



350 



Val Glu Trp Phe Leu Leu Arg Lys Ala Tyr Glu Arg Asn Glu Val Ala 

360 365 



355 



Pro Asn Lys Pro Ser Glu Glu Glu Tyr Gin Arg Arg Leu Arg Qlu Ser 
370 375 380 

Tyr Thr Gly Gly Phe Val Lys Glu Pro Glu Lys Gly Leu Trp Glu Asn 



385 



390 



395 



He Val Tyr Leu Asp Phe Arg Ala Leu Tyr Pro Ser He He He Thr 
405 ^10 ^15 

His Asn Val Ser Pro Asp Thr Leu Asn Leu Glu Gly Gys Lys Asn Tyr 
420 425 430 

Asp He Ala Pro Gin Val Gly His Lys Phe Cys Lys Asp He Pro Gly 
435 440 445 



wo 92/09689 



PCr/US91/09026 



• 65 

Phe lie Pro Ser Leu Leu Gly His Leu Leu Glu Glu Arg Gin Lys lie 
450 455 460 

Lys Thr Lys Met Lys Glu Thr Gin Asp Pro lie Glu Lys lie Leu Leu 
465 470 475 480 

Asp Tyr Arg Gin Lys Ala lie Lys Leu Leu Ala Asn Ser Phe Tyr Gly 
485 490 495 

Tyr Tyr Gly Tyr Ala Lys Ala Arg Trp Tyr Cys Lys Glu Cys Ala Glu 
500 505 510 

Ser Val Thr Ala Trp Gly Arg Lys Tyr lie Glu Leu Val Trp Lys Glu 
515 520 525 

Leu Glu Glu Lys Phe Gly Phe Lys Val Leu Tyr He Asp Thr Asp Gly 
530 535 540 

Leu Tyr Ala Thr He Pro Gly Gly Glu Ser Glu Glu He Lys Lys Lys 
545 550 555 560 

Ala Leu Glu Phe Val Lys Tyr He Asn Ser Lys Leu Pro Gly Leu Leu 
565 570 575 

Glu Leu Glu Tyr Glu Gly Phe Tyr Lys Arg Gly Phe Phe Val Thr Lys 
580 585 590 

Lys Arg Tyr Ala Val He Asp Glu Glu Gly Lys Val He Thr Arg Gly 
595 600 605 

Leu Glu He Val Arg Arg Asp Trp Ser Glu He Ala Lys Glu Thr Gin 
610 615 620 

Ala Arg Val Leu Glu Thr He Leu Lys His Gly Asp Val Glu Glu Ala 
625 630 635 640 

Val Arg He Val Lys Glu Val He Gin Lys Leu Ala Asn Tyr Glu He 
645 650 655 

Pro Pro Glu Lys Leu Ala He Tyr Glu Gin He Thr Arg Pro Leu His 
660 665 670 

Glu Tyr Lys Ala He Gly Pro His Val Ala Val Ala Lys Lys Leu Ala 
675 680 685 

Ala Lys Gly Val Lys He Lys Pro Gly Met Val He Gly Tyr He Val 
690 695 700 



Leu Arg Gly Asp Gly Pro He Ser Asn Arg Ala He Leu Ala Glu Glu 
705 710 715 720 

Tyr Asp Pro Lys Lys His Lys Tyr Asp Ala Glu Tyr Tyr He Glu Asn 
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725 730 735 

nn Val Leu Pro Ala Val Leu Arg He Leu Glu Gly Phe Gly Tyr Arg 
740 745 750 

Lvs Glu Asp Leu Arg Tyr Gin Lys Thr Arg Gin Val Gly Leu Thr Ser 
^ 755 760 765 

Trp Leu Asn He Lys Lys Ser 
770 775 

(2) INFORMATION FOR SEQ II> NO: 2: 

(L) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3499 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(ix) FEATURE: 

CA) NAME/KEY: 5'UTR 
(B) LOCATION: 1. .223 

Cix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 224.. 2551 

Cix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 2552. .3499 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CCCTGGTCCT GGGTCCACAT ATATGTTCTT ACTCGCCTTT ATGAAGAATC CCCCAGTCGC 60 

TCTAACCTGG GTTATAGTGA CAAATCTTCC TCCACCAGCG CCCAAGAAGG TTATTTCTAT 120 

CAACTCTACA CCTCCCCTAT TTTCTCTCTT ATGAGATTTT TAAGTATAGT TATAGAGAAG 180 

GTTTTATACT CCAAACTGAG TTAGTAGATA TGTGGGGAGC ATAATGATTT TAGATGTGGA 240 

TTACATAACT GAAGAAGGAA AACCTGTTAT TAGGCTATTC AAAAAAGAGA ACGGAAAATT 300 

TAAGATAGAG CATGATAGAA CTTTTAGACC ATACATTTAC GCTCTTCTCA GGGATGATTC 360 
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AAAGATTGAA GAAGTTAAGA AAATAACGGG GGAAAGGCAT GGAAAGATTG TGAGAATTGT A20 

TGATGTAGAG AAGGTTGAGA AAAAGTTTCT CGGCAAGCCT ATTACCGTGT GGAAACTTTA 480 

TTTGGAACAT CCCCAAGATG TTCCCACTAT TAGAGAAAAA GTTAGAGAAC ATCCAGCAGT 540 

TGTGGACATC TTCGAATACG ATATTCCATT TGCAAAGAGA TACCTCATCG ACAAAGGGCT 600 

AATACCAATG GAGGGGGAAG AAGAGCTAAA GATTCTTGCC TTCGATATAG AAACCCTCTA 660 

TCACGAAGGA GAAGAGTTTG GAAAAGGCCC AATTATAATG ATTAGTTATG CAGATGAAAA 720 

TGAAGCAAAG GTGATTACTT GGAAAAAGAT AGATCTTCCA TACGTTGAGG TTGTATCAAG 780 

CGAGAGAGAG ATGATAAAGA GATTTCTCAG GATTATCAGG GAGAAGGATC CTGACATTAT 840 

AGTTACTTAT AATGGAGACT CATTCGACTT CCCATATTTA GCGAAAAGGG CAGAAAAACT 900 

TGGGATTAAA TTAACCATTG GAAGAGATGG AAGCGAGCCC AAGATGCAGA GAATAGGCGA 960 

TATGACGGCT GTAGAAGTCA AGGGAAGAAT ACATTTCGAC TTGTATCATG TAATAACAAG 1020 

GACAATAAAT CTCCCAACAT ACACACTAGA GGCTGTATAT GAAGCAATTT TTGGAAAGCC 1080 

AAAGGAGAAG GTATACGCCG ACGAGATAGC AAAAGCCTGG GAAAGTGGAG AGAACCTTGA 1140 

GAGAGTTGCC AAATACXCGA TGGAAGATGC AAAGGCAACT TATGAACTCG GGAAAGAATT 1200 

CCTTCCAATG GAAATTCAGC TTTCAAGATT AGTTGGACAA CGTTTATGGG ATGTTTCAAG 1260 

GTCAAGCACA GGGAACCTTG TAGAGTGGTT CTTACTTAGG AAAGCCTACG AAAGAAACGA 1320 

AGTAGCTCCA AACAAGCCAA GTGAAGAGGA GTATCAAAGA AGGCTCAGGG AGAGCTACAC 1380 

AGGTGGATTC GTTAAAGAGG CAGAAAAGGG GTTGTGGGAA AACATAGTAT ACCTAGATTT 1440 

TAGAGCCCTA TATCCCTCGA TTATAATTAC CCACAATGTT TCTCCCGATA CTCTAAATCT 1500 

TGAGGGATGC AAGAACTATG ATATCGCTCC TCAAGTAGGC CACAAGTTCT GCAAGGACAT 1560 

CCCTGGTTTT ATACCAAGTG TCTTGGGACA TTTGTTAGAG GAAAGACAAA AGATTAAGAC 1620 

AAAAATGAAG GAAACTCAAG ATCCTATAGA AAAAATACTC CTTGACTATA GACAAAAAGC 1680 

GATAAAACTC TTAGCAAATT CTTTCTACGG ATATTATGGC TATGCAAAAG CAAGATGGTA 1740 

CTGTAAGGAG TGTGCTGAGA GCGTTACTGC CTGGGGAAGA AAGTACATCG AGTInGTATG 1800 

GAAGGAGCTC GAAGAAAAGT TTGGATTTAA AGTCCTCTAC ATTGACACTG ATGGTCTCTA 1860 

TGCAACTATC CCAGGAGGAG AAAGTGAGGA AATAAAGAAA AAGGCTCTAG AATTTGTAAA 1920 
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ATACATAAAT 


TCAAAGCTCC 


CTGGAGTGCT 


AGAGGTXGAA 


TATGAAGGGT 


TTTATAAGAG 


1980 


GGGATTCTTC 


GTTACGAAGA 


AGAGGTATGC 


AGTAATAGAT 


GAAGAAGGAA 


AAGTCATTAC 


2040 


TCGTGGTTTA 


GAGATAGTTA 


GGAGAGATTG 


GAGTGAAATT 


GCAAAAGAAA 


CTCAAGCTAG 


2100 


AGTTTTGGAG 


ACAATACTAA 


AACACGGAGA 


TGTTGAAGAA 


GGTGTGAGAA 


TAGTAAAAGA 


2160 


AGTAATACAA 


AAGCTTGCCA 


ATTATGAAAT 


TCCACGAGAG 


AAGCTCGCAA 


TATATGAGCA 


2220 


GATAACAAGA 


CCATTACATG 


AGTATAAGGC 


GATAGGTCCT 


CAGGTAGCTG 


TTGCAAAGAA 


2280 


ACTAGCTGCT 


AAAGGAGTTA 


AAATAAAGCC 


AGGAATGGTA 


ATTGGATAGA 


TAGTACTTAG 


2340 


AGGCGATGGT 


CCAATTAGCA 


ATAGGGCAAT 


TCTAGCTGAG 


GAATACGATC 


CCAAAAAGCA 


2400 


CAAGTATGAC 


GCAGAATATT 


ACATTGAGAA 


CGAGGTTGTT 


CGAGCGGTAC 


TTAGGATATT 


2460 


GGAGGGATTT 


GGATACAGAA 


AGGAAGACCT 


CAGATACGAA 


AAGACAAGAC 


AAGTCGGCCT 


2520 


AACTTCCTGG 


CTTAACATTA 


AAAAATCCTA 


GAAAAGCGAT 


AGATATCAAC 


TTTTATTCTT 


2580 


TGTAACCTTT 


TTCTATGAAA 


GAAGAACTGA 


GCAGGAATTA 


CGAGTTCTTG 


CGTTATTTTA 


2640 


TGGGTAATTA 


AAAAGCCATG 


CTCTTGGGAG 


AATCTTGGAA 


TAAAATCCCT 


AACTTCAGGC 


2700 


TTTGCTAAGT 


GAATAGAATA 


AACAACATGA 


CTCACTTGAA 


ACGGCTTCGT 


TAGAAATGGT 


2760 


CTATCTGCAT 


GCTTCTGTGG 


CTCGGAAKNG 


GAGGATTCAT 


AAGAAGAGTA 


TCAACATTCT 


2820 


CAGAGAATTG 


AGAAAGATCA 


GAAAGTTTGA 


CTTCTACAAG 


ATTTCTAACT 


TTGCAACTCT 


2880 


TCAAGATTTT 


CTAAAAGAAT 


TTTAAGGGCC 


TCCTCGTGAA 


TTTCGACGAC 


GTAGATCTTT 


2940 


TTTGCTCCAA 


GCAGAGCGGG 


TGGAATGGAT 


AACAGGCGTG 


TTCCCGCACC 


CAAGTCCGCT 


3000 


ACAATTTTTT 


CCTTGTATCT 


CGTAATGTAT 


AAGCAAGCGA 


AAGGAGAGTA 


GATGCTACCT 


3060 


TTCCGGGAGT 


TTTGTATTGG 


TGTAGGCAAG 


GTTTGGGATT 


TTTGAATCCT 


TTAACTCTGG 


3120 


AAAGTATAAT 


TTCAAGCTCC 


TTCTTCTTCA 


TGACAGATGA 


AAAATTGTTT 


TGTCTCTTTT 


3180 


TAACTTTTAC 


AGAAATAACT 


GTCTGAAATT 


ATGACAACTG 


TTGACATTTT 


TACTTCATTA 


3240 


CCAGGGTAAT 


GTTTTTAAGT 


ATGAAATTTT 


TCTTTCATAG 


AGGAGGNNNN 


NNGTCCTCTC 


3300 


CTCGATTTCC 


TTGGTTGTGC 


TGCATATGAT 


AAGCTTCCAA 


AGTGGGTGTT 


GAGAGTTTTA 


3360 


GACAGTCAAA 


TAGCAGACGA 


CAATGGTGTG 


CTCACTCAAG 


CCCCATATGG 


GTTGAGAAAA 


3420 


GTAGAAGCGG 


CACTACTGAG 


ATGCTTGCGC 


AGGAATGAGG 


TTGTTGTAGC 


TCNTCCGNGA 


3480 
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AAGATTGAGA TGTTCTTGG 



3-199 
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What is Claimed is: 

1. Purified thermostable Pyrococcus furiosus 
DNA polymerase I having an amino-terminal eunino acid 
residue sequence represented by the formula shown in 

5 SEQ ID NO 1 from residue 1 to residue 775. 

2. The polymerase of claim 1 wherein said 
polymerase migrates on a non-denaturing polyacrylamide 
gel faster than phosphorylase B and Tag polymerase and 
more slowly than bovine serum albumin and has an 

10 estimated molecular weight of 90,000-93^000 daltons 

when compared with a Tag polymerase standard assigned 
a molecular weight of 94,000 daltons. 

3. The polymerase of claim 1 that is isolated 
from Pyrococcus furiosus . 

15 4, The polymerase of claim 1 that is isolated 

from a recombinant organism transformed with a vector 
that codes for the expression of Pyrococcus furiosus 
DNA polymerase. 

5. A plasmid containing a gene coding for a 

20 Pyrococcus furiosus DNA polymerase I having an amino- 

terminal amino acid residue sequence represented by 
the formula shown in SEQ ID NO 1 from residue 1 to 
residue 775. 

6. The plasmid of claim 5 wherein said 

25 polymerase has a molecular weight of about 90,000 to 
93,000 daltons as determined by SDS-PAGE under non- 
reducing conditions using Tag polymerase as a 
molecular weight marker having an assigned molecular 
weight of 94,000 daltons. 

30 7. The plasmid of claim 5 wherein said gene is 

operably linked to a promoter. 

8. A procaryotic cell transformed with the 
plasmid of claim 5. 

9. A bacteriophage containing a gene coding for 
35 a Pyrococcus furiosus DNA polymerase I having an 
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aiaino-t:enninal amino acid residue sequence represented 
by the formula shown in SEQ ID NO 1 from residue 1 to 
residue 775. 

10. The bacteriophage of claim 9 wherein said 
5 polymerase has a molecular weight of about 90,000 to 
93,000 daltons as determined by SDS-PAGE under non- 
denaturing conditions using Taq polymerase as a 
molecular weight marker having an assigned molecular 
weight of 94,000 daltons. 
10 11. The bacteriophage of claim 9 wherein said 

gene is operatively linked to a promoter. 
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